PA \lT COOPERATION TREAT ) 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF THE RECORDING 
OF A CHANGE 

(PCT Rule 92bis.1 and 
Administrative Instructions, Section 422) 


To; 

MORROW, Joy, D. 
Smart & Biggar 
900-55 Metcalfe Street 
P.O. Box 2999 
Station D 

Ottawa, Ontario K1P 5Y6 
CANADA 


Date of mailing (day/month/year) 
20 March 2001 (20.03.01) 


Applicant's or agent's file reference 
77813-1 


IMPORTANT NOTIFICATION 


International application No. 
PCT/CA99/00992 


International filing date (day/month/year) 

28 October 1999 (28.10.99) 



1. The following indications appeared on record concerning: 

X the applicant X the inventor | | the agent | | the common representative 


Name and Address 

MURDIN, Andrew, D. 
146 Rhodes Circle 
Newmarket, Ontario L3X1V2 
Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
| | the person | [ the name X the address | | the nationality | | the residence 


Name and Address 

MURDIN, Andrew, D. 

11 Forest Hill Drive 

Richmond Hill, Ontario L4B 3C2 

Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


3. Further observations, if necessary: 


4. A copy of this notification has been sent to: 
| X | the receiving Office | | the designated Offices concerned 
| | the International Searching Authority | X| the elected Offices concerned 
| X| the International Preliminary Examining Authority | | other: 



The International Bureau of WIPO 


Authorized officer 




34, chemin des Colombettes 


Athina Nickitas-Etienne 


1211 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 


Form PCT/IB/306 (March 1994) 


003908655 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF THE RECORDING 
OF A CHANGE 

(PCT Rule 92bis.1 and 
Administrative Instructions, Section 422) 


To: 

MORROW, Joy, D. 

Smart& Biggar < 
900-55 Metcalfe Street 
P.O. Box 2999 
Station D 

Ottawa, Ontario K1P 5Y6 
CANADA 


Date of mailing (day/month/year) 
20 March 2001 (20.03.01) 


Applicant's or agent* s file reference 
77813-1 


IMPORTANT NOTIFICATION 


International application No. 
PCT/CA99/00992 


International filing date (day/month/year) 
28 October 1999 (28.10.99) 



1. The following indications appeared on record concerning: 

X the applicant X the inventor | | the agent ( ] the common representative 


Name and Address 

WANG, Joe 
48 29th Street 

Etobicoke, Ontario M8W 3A8 
Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


2. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: , 
| | the person | | the name X the address | | the nationality | | the residence 


Name and Address 

WANG, Joe 

51 Aspenwood Drive 

Toronto, Ontario M2H 2E8 

Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


3. Further observations, if necessary: 


.4. A copy of this notification has been sent to: 
| X| the receiving Office | | the designated Offices concerned 
| [ the International Searching Authority | X| the elected Offices concerned 
| X [ the International Preliminary Examining Authority | | other: 





Authorized officer 


The International Bureau of WIPO 


34, chemin des Colombettes 


Athina Nickitas-Etienne 


1211 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 


Form PCT/IB/306 (March 1994) 


003908656 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF ELECTION 

(PCT Rule 61.2) 


To: 

Assistant Commissioner for Patents 
United States Patent and Trademark 
Office 
Box PCT 

Washington, D.C.20231 
ETATS-UNIS D'AMERIQUE 

in its capacity as elected Office 


Date of mailing (day/month/year) 
17 August 2000 (17.08.00) 




International application No. 
PCT/CA99/00992 


Applicant's or agenfs file reference 
77813-1 


International filing date (day/month/year) 

28 October 1999(28.10.99) 


Priority date (day/month/year) 

28 October 1998(28.10.98) 


Applicant 

MURDIN, Andrew, D. et al 



1. The designated Office is hereby notified of its election made: 

| X| in the demand filed with the International Preliminary Examining Authority on: 
26 May 2000 (26.05.00) 

| | in a notice effecting later election filed with the International Bureau on: 



2. The election | X | was 

| | was not 

made before the expiration of 1 9 months from the priority date or, where Rule 32 applies, within the time limit under 
Rule 32.2(b). 





Authorized officer 




The International Bureau of WIPO 






34, chemin des Colombettes 


F. Baechler 




1211 Geneva 20, Switzerland 






Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 




Form PCT/IB/331 (July 1992) 




CA9900992 



From the INTERNATIONAL BUREAU 



PCT 

NOTIFICATION OF THE RECORDING 
OF A CHANGE 

(PCT Rule 92bis.1 and 
Administrative Instructions, Section 422) 


To: 

MORROW, Joy, D. 
Smart & Biggar 
900-55 Metcalfe Street 
P.O. Box 2999 
Station D 

Ottawa, Ontario K1P 5Y6 

PANADA 


Date of mailing (day/month/yea r) 
08 November 2000 (08.1 1 .00) 


Applicant's or agenfs file reference 
77813-1 


IMPORTANT NOTIFICATION 


International application No. 
PCT/CA99/00992 


International filing date (day/month/year) 

28 October 1999 (28.10.99) 



1. The following indications appeared on record concerning: 

X the applicant | | the inventor | | the agent. | | the common representative 


Name and Address 

CONNAUGHT LABORATORIES LIMITED ' 
1755 Steeles Avenue West 
Toronto, Ontario M2R 3T4 
Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


Z. The International Bureau hereby notifies the applicant that the following change has been recorded concerning: 
| | the person X the name | | the address | | the nationality | | the residence 


Name and Address 

AVENTIS PASTEUR LIMITED 
1755 Steeles Avenue West 
Toronto, Ontario M2R 3T4 
Canada 


State of Nationality 
CA 


State of Residence 
CA 


Telephone No. 


Facsimile No. 


Teleprinter No. 


3. Further observations, if necessary: 


4. A copy of this notification has been sent to: 
| X| the receiving Office | | the designated Offices concerned 
| | the International Searching Authority | X| the elected Offices concerned 
| X| the International Preliminary Examining Authority | | other: 



The International Bureau of WIPO 


Authorized officer 


34, chemin des Colombettes 


Jean-Marie McAdams 


121 1 Geneva 20, Switzerland 




Facsimile No.: (41-22) 740.14.35 


Telephone No.: (41-22) 338.83.38 


Form PCT/IB/306 (March 1994) 


003643041 



ENT COOPERATION TREATY 

PCT 



INTERNATIONAL SEARCH REPORT 

(PCT Article 18 and Rules 43 and 44) 



Applicant's or agent's file reference 

77813-1 


FOR FURTHER see Notification of Transmittal of International Search Report 

(Form PCT/ISA/220) as well as, where applicable, item 5 below. 

ACTION 


International application No. 

PCT/ CA 99/ 00992 


International filing date (day/month/year) 

28/10/1999 


(Earliest) Priority Date (day/month/year) 

28/10/1998 


Applicant 

CONNAUGHT LABORATORIES LIMITED et al. 



This International Search Report has been prepared by this International Searching Authority and is transmitted to the applicant 
according to Article 18. A copy is being transmitted to the International Bureau. 



This International Search Report consists of a total of _ 



sheets. 



| | It is also accompanied by a copy of each prior art document cited in this report. 



Basis of the report 

a. With regard to the language, the international search was carried out on the basis of the international application in the 
language in which it was filed, unless otherwise indicated under this item. 

I I the international search was carried out on the basis of a translation of the international application furnished to this 
1 — 1 Authority (Rule 23.1 (b)). 

b. With regard to any nucleotide and/or amino acid sequence disclosed in the international application, the international search 
was carried out on the basis of the sequence listing : 

| | contained in the international application in written form. 

| | filed together with the international application in computer readable form. 

| X [ furnished subsequently to this Authority in written form. 

| X | furnished subsequently to this Authority in computer readble form. 

[X] the statement that the subsequently furnished written sequence listing does not go beyond the disclosure in the 
international application as filed has been furnished. 



the statement that the information recorded in computer readable form is identical to the written sequence listing has been 
furnished 



[ | Certain claims were found unsearchable (See Box I). 
|"X~| Unity of invention is lacking (see Box II). 



4. With regard to the title, 

[X"| the text is approved as submitted by the applicant. 

| | the text has been established by this Authority to read as follows: 



5. With regard to the abstract, 

| X [ the text is approved as submitted by the applicant. 

□ the text has been established, according to Rule 38.2(b), by this Authority as it appears in Box III. The applicant may, 
within one month from the date of mailing of this international search report, submit comments to this Authority. 

6. The figure of the drawings to be published with the abstract is Figure No. 

[ | as suggested by the applicant. [X] None of the figures. 

[ | because the applicant failed to suggest a figure. 

| | because this figure better characterizes the invention. 



Form PCT/1SA/21 0 (first sheet) (July 1998) 



SEARCH REPORT j^P 



|ernational application No. 

INTERNATIOIWSEARCH REPORT — ^ PGT ' CA "' Q °" Z 



B xl Observations wh r certain claims were found unsearchable (Continuation of it ml of first she t) 

This International Search Report has not been established in respect of certain claims under Article 17(2)(a) for the following reasons: 
1. | | Claims Nos.: 



because they relate to subject matter not required to be searched by this Authority, namely: 



Claims Nos.: 

because they relate to parts of the International Application that do not comply with the prescribed requirements to such 
an extent that no meaningful International Search can be carried out, specifically: 



3. Claims Nos.: 

because they are dependent claims and are not drafted in accordance with the second and third sentences of Rule 6.4(a). 

Box II Observations where unity of invention is lacking (Continuation of item 2 of first sheet) 

This International Searching Authority found multiple inventions in this international application, as follows: 

See additional sheet 



I . | I As all required additional search fees were timely paid by the applicant, this International Search Report covers all 
' ' searchable claims. 



2. | | As all searchable claims could be searched without effort justifying an additional fee, this Authority did not invite payment 
of any additional fee. 



3. I I As only some of the required additional search fees were timely paid by the applicant, this International Search Report 
* ' covers only those claims for which fees were paid, specifically claims Nos.: 



4. I y J No required additional search fees were timely paid by the applicant. Consequently, this International Search Report is 
^""^ restricted to the invention first mentioned in the claims; it is covered by claims Nos.: 

1 - 24 (all partially) 



Remark on Protest [ | The additional search fees were accompanied by the applicant's protest. 

j | No protest accompanied the payment of additional search fees. 



Form PCT/ISA/210 (continuation of first sheet (1 )) (July 1998) 



International Application No. PCT/CA 99/00992 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



1. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 1 and 2 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 27 and 
28. Encoded polypeptide, inmnunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
imnunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide . Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



2. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 3 and 4 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 29. 
Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide- Modified polypeptide having improved 
immunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide . Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



3. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 5 and 6 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 30. 
Encoded polypeptide, imnunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
irrmunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide . Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



page 1 of 5 



International Application No. PCT/CA 99/00992 



FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



4. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 7 and 8 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 31. 
Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
immunogenicity. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide . Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



5. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 9 and 10 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 32. 
Encoded polypeptide, irnnunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
imnunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



6. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 11 and 12 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 33 and 
34. Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
imnunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
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FURTHER INFORMATION CONTINUED FROM PCT/ISA/ 210 



make use of the nucleic acid, the polypeptide or the 
antibody. 



7. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 13 and 14 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 35 and 
36. Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
immunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



8. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 15 and 16 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 37. 
Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
immunogenicity. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



9. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 17 and 18 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 38 and 
39. Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
imnunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
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stringent conditions to the nucleic molecule. Methods for 

preventing, treating or diagnosing Chlamydia infection which 

make use of the nucleic acid, the polypeptide or the 
antibody. 



10. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 19 and 20 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 40 and 
41. Encoded polypeptide, imnunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
imnunogenicity. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



11. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 21 and 22 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 42. 
Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
immunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



12. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 23 and 24 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 43. 
Encoded polypeptide, immunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
immunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
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acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 



13. Claims: 1-24 (all partially) 

A nucleic acid molecule comprising a nucleic acid sequence 
as shown in SEQ ID NO: 25 and 26 which encodes a polypeptide 
having an amino acid sequence as shown in SEQ ID NO: 44 and 
45. Encoded polypeptide, imnunogenic fragment thereof and 
antibody directed thereto. Fusion polypeptide comprising 
said polypeptide. Modified polypeptide having improved 
innmunogenici ty. Vaccine, pharmaceutical composition and 
diagnostic kit comprising said nucleic acid sequence or 
polypeptide. Unicellular host transformed with the nucleic 
acid. Method for producing the polypeptide by culturing said 
host. Nucleic acid probe and primer which hybridize under 
stringent conditions to the nucleic molecule. Methods for 
preventing, treating or diagnosing Chlamydia infection which 
make use of the nucleic acid, the polypeptide or the 
antibody. 
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INTERNATIONAL SEARCH REPORT 



International Application No 

|CA 99/00992 



A. CLASSIFICATION OF SUBJECT MATTER _ 

IPC 7 C12N15/62 C07K14/295 C07K16/12 A61K39/118 G01N33/53 
C12Q1/68 C12N5/10 

According to International Patent Classification (IPC) or to both national classification and IPC ^ 



B. FIELDS SEARCHED 



Minimum documentation searched (classification system followed by classification symbols) 

IPC 7 C12N C07K A61K G01N C12Q 



Documentation searched other than minimum documentation to the extent that such documents are included in the fields searched 



Electronic data base consulted during the international search (name of data base and, where practical, search terms used) 



C. DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



p.x 



PEREZ MELGOSA M ET AL: "OUTER MEMBRANE 
COMPLEX PROTEINS OF CHLAMYDIA PNEUMONIAE" 
FEMS MICROBIOLOGY LETTERS, NL, AMSTERDAM, 
vol. 112, no. 2, 

1 September 1993 (1993-09-01), pages 
199-204, XP002057607 

ISSN: 0378-1097 
the whole document 

W0 98 58953 A (MADSEN ANNA S0FIE 
;BIRKELUND SVEND (DK) ; KNUDSEN KATRINE 
(DK); MYG) 30 December 1998 (1998-12-30) 
page 5, line 24 -page 21, line 29 
pages 49-22, SEQ ID NO: 7 and 8; pages 
83-88, SEQ ID NO: 29 and 30 

-/-- 



16,21 



1-24 



Further documents are listed in the continuation of box C. 



Patent family members are listed in annex. 



0 Special categories of cited documents : 

"A" document defining the general state of the art which is not 

considered to be of particular relevance 
"E' earlier document but published on or after the international 

filing date 

"L" document which may throw doubts on priority claim (s) or 
which is cited to establish the publication date of another 
citation or other special reason (as specified) 

"O" document referring to an oral disclosure, use, exhibition or 
other means 

"P" document published prior to the international filing date but 
later than the priority date claimed 



T later document published after the international filing date 
or priority date and not in conflict with the application but 
cited to understand the principle or theory underlying the 
invention 

"X* document of particular relevance; the claimed invention 
cannot be considered novel or cannot be considered to 
involve an inventive step when the document is taken alone 

"Y" document of particular relevance; the claimed invention 

cannot be considered to involve an inventive step when the 
document is combined with one or more other such docu- 
ments, such combination being obvious to a person skilled 
in the art. 

document member of the same patent family 



Date of the actual completion of the international search 

11 April 2000 


Date of mailing of the international search report 

03. 07. 01 


Name and mailing address of the ISA 

European Patent Office, P.B. 5818 Patentlaan 2 
NL - 2280 HV Rijswijk 

Tel. (+31-70) 340-2040, Tx. 31 651 epo nl, 
Fax: (+31-70) 340-3016 


Authorized officer 

Kaas, V 
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C.(Contmuation) DOCUMENTS CONSIDERED TO BE RELEVANT 



Category ° Citation of document, with indication, where appropriate, of the relevant passages 



Relevant to claim No. 



WO 99 27105 A (GRIFFAIS REMY ;GENSET 
3 June 1999 (1999-06-03) 
page 5, line 10 -page 18, line 34 
46, line 4 -page 47, line 13 
6 -page 54, line 30 
34 -page 61, line 22 
18 -page 66, line 3 
36 -page 73, line 31 



(FR)) 



1-24 



page 
page 
page 
page 
page 
pages 



51, 
59, 
63, 
68, 
291 



1 i ne 
1 ine 
1 ine 
line 
611, 



SEQ ID N0:1; pages 630-631 



WIEDMANN-AL-AHMAD, M. : "Reactions of 

polyclonal and neutralizing anti-p54 

monoclonal antibodies with an isolated, 

species-specific 54-ki lodalton protein of 

Chlamydia pneumoniae" 

CLINICAL AND DIAGNOSTIC LABORATORY 

IMMUNOLOGY, 

vol- 4, no. 6, November 1997 (1997-11), 
page 700-704 XP002132124 
cited in the application 
the whole document 

ILJIMA, Y.: "Characterization of 

Chlamydia pneumoniae species-specific 

proteins immunodominant in humans" 

JOURNAL OF CLINICAL MICROBIOLOGY, 

vol. 32, no. 3, March 1994 (1994-03), page 

583-588 XP000881638 

the whole document 
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International Application No 

^^CA 99/00992 


Patent document 
cited in search report 


Publication 
date 


Patent family 
member(s) 


Publication 
date 



WO 9858953 A 30-12-1998 AU 8011998 A 04-01-1999 

EP 1007685 A 14-06-2000 



W0 9927105 A 03-06-1999 AU 1170299 A 15-06-1999 



Form PCT/ISA/210 (patent f amity annex) (July 1992) 



Rld intellectual PROPERTY ORGANIZATION 
International Bureau 




PCT, 

INTERNATIONAL APPLICATION PUBLISHED UNDER THE PATENT COOPERATION TREATY (PCT) 



(51) International Patent Classification 7 
C07K 14/00 



A2 



(11) International Publication Number: 
(43) International Publication Date: 



WO 00/24765 

4 May 2000 (04.05.00) 



(21) International Application Number: PCT/CA99/00992 

(22) International Filing Date: 28 October 1999 (28.10.99) 



(30) Priority Data: 

60/106,034 
60/106,044 
60/106,039 
60/106,042 
60/106,087 
60/106,072 
60/106,073 
60/106,074 
60/106,589 
60/107,034 
60/107,035 
60/106,587 
60/106,588 



28 October 1998 (28.1 0.98) US 

28 October 1998 (28.10.98) US 
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FIELD OF INVENTION 

The present invention relates to Chlamydia antigens 
25 and corresponding DNA molecules, which can be used to prevent 
and treat Chlamydia infection in mammals, such as humans. 

BACKGROUND OF THE INVENTION 

Chlamydiae are prokaryotes. They exhibit morphologic 
30 and structural similarities to gram-negative bacteria including 
a trilaminar outer membrane, which contains lipopolysaccharide 
and several membrane proteins that are structurally and 
functionally analogous to proteins found in £ coli. They are 
obligate intra-cellular parasites with a unique biphasic life 
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cycle consisting of a metabolically inactive but infectious 
extracellular stage and a replicating but non-infectious 
intracellular stage. The replicative stage of the life-cycle 
takes place within a membrane-bound inclusion which sequesters 
5 the bacteria away from the cytoplasm of the infected host cell. 

C. pneumoniae is a common human pathogen, originally 
described as the TWAR strain of Chlamydia psittaci but 
subsequently recognised to be a new species. C. pneumoniae is 
antigenically, genetically and morphologically distinct from 

10 other chlamydia species (C. trachomatis, C. pecorum and C. 
psittaci) . It shows 10% or less DNA sequence homology with 
either of C . trachomatis or C. psittaci. 

C. pneumoniae is a common cause of community acquired 
pneumonia, only less frequent than Streptococcus pneumoniae and 

15 Mycoplasma pneumoniae (Grayston et al. (1995) Journal of 

Infectious Diseases 168:1231; Campos et al. (1995) Investigation 
of Ophthalmology and Visual Science 36:1477). It can also cause 
upper respiratory tract symptoms and disease, including 
bronchitis and sinusitis (Grayston et al. (1995) Journal of 

20 Infectious Diseases 168:1231; Grayston et al (1990) Journal of 
Infectious Diseases 161:618; Marrie (1993) Clinical Infectious 
Diseases. 18:501; Wang et al (1986) Chlamydial infections). 
Cambridge University Press, Cambridge, p. 329The great majority 
of the adult population (over 60%) has antibodies to C. 

25 pneumoniae (Wang et al (1986) Chlamydial infections. Cambridge 
University Press, Cambridge, p. 329), indicating past infection 
which was unrecognized or asymptomatic. 

C. pneumoniae infection usually presents as an acute 
respiratory disease (i.e., cough, sore throat, hoarseness, and 

30 fever; abnormal chest sounds on auscultation) . For most 
i patients, the cough persists for 2 to 6 weeks, and recovery is 

slow. In approximately 10% of these cases, upper respiratory 
tract infection is followed by bronchitis or pneumonia. 
Furthermore, during a C. pneumoniae epidemic, subsequent 
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co-infection with pneumococcus has been noted in about half of 
these pneumonia patients, particularly in the infirm and the 
elderly. As noted above, there is more and more evidence that 
C. pneumoniae infection is also linked to diseases other than 
5 respiratory infections . 

The reservoir for the organism is presumably people. 
In contrast to C. psittaci infections, there is no known bird or 
animal reservoir. Transmission has not been clearly defined. It 
may result from direct contact with secretions, from fomites, or 
10 from airborne spread. There is a long incubation period, which 
may last for many months. Based on analysis of epidemics, C. 
pneumoniae appears to spread slowly through a population (case- 
to-case interval averaging 30 days) because infected persons are 
inefficient transmitters of the organism. Susceptibility to C. 
15 pneumoniae is universal. Reinfections occur during adulthood, 
following the primary infection as a child. C. pneumoniae 
appears to be an endemic disease throughout the world, 
noteworthy for superimposed intervals of increased incidence 
(epidemics) that persist for 2 to 3 years. C . trachomatis 
20 infection does not confer cross-immunity to C. pneumoniae. 
Infections are easily treated with oral antibiotics, 
tetracycline or erythromycin (2 g/d, for at least 10 to 14 d) . A 
recently developed drug, azithromycin, is highly effective as a 
single-dose therapy against chlamydial infections. 
25 In most instances, C. pneumoniae infection is often 

mild and without complications, and up to 90% of infections are 
subacute or unrecognized. Among children in industrialized 
countries, infections have been thought to be rare up to the age 
of 5 y, although a recent study (E Normann et al, Chlamydia 
30 pneumoniae in children with acute respiratory tract infections, 
Acta Paediatrica, 1998, Vol 87, Iss 1, pp 23-27) has reported 
that many children in this age group show PCR evidence of 
infection despite being seronegative, and estimates a prevalence 
of 17-19% in 2-4 y olds. In developing countries, the 
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seroprevalence of C. pneumoniae antibodies among young children 
is elevated, and there are suspicions that C. pneumoniae may be 
an important cause of acute lower respiratory tract disease and 
mortality for infants and children in tropical regions of the 
5 world. 

From seroprevalence studies and studies of local 
epidemics, the initial C. pneumoniae infection usually happens 
between the ages of 5 and 20 y. In the USA, for example, there 
are estimated to be 30,000 cases of childhood pneumonia each 

10 year caused by C. pneumoniae. Infections may cluster among 
groups of children or young adults (e.g., school pupils or 
military conscripts) . 

C. pneumoniae causes 10 to 25% of community-acquired 
lower respiratory tract infections (as reported from Sweden, 

15 Italy, Finland, and the USA) . During an epidemic, C. pneumonia 
infection may account for 50 to 60% of the cases of pneumonia. 
During these periods, also, more episodes of mixed infections 
with 5. pneumoniae have been reported. 
Reinfection during adulthood is common; the clinical 

20 presentation tends to be milder. Based on population 

seroprevalence studies, there tends to be increased exposure 
with age, which is particularly evident among men. Some 
investigators have speculated that a persistent, asymptomatic C. 
pneumoniae infection state is common. 

25 In adults of middle age or older, C. pneumoniae 

infection may progress to chronic bronchitis and sinusitis. A 
study in the USA revealed that the incidence of pneumonia caused 
by C. pneumoniae in persons younger than 60 years is 1 case per 
1,000 persons per year; but in the elderly, the disease 

30 incidence rose three-fold. C. pneumoniae infection rarely leads 
to hospitalization, except in patients with an underlying 
illness . 

Of considerable importance is the association of 
atherosclerosis and C. pneumoniae infection. There are several 
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epidemiological studies showing a correlation of previous 
infections with C. pneumoniae and heart attacks, coronary artery 
and carotid artery disease (Saikku et al. (1988) Lancet ; ii : 983 ; 
Thorn et al. (1992) JAMA 268:68; Linnanmaki et al. (1993), 
5 Circulation 87:1030; Saikku et al. (1992)Annals Internal 
Medicine 116:273; Melnick et al (1993) American Journal of 
Medicine 95:499). Moreover, the organisms has been detected in 
atheromas and fatty streaks of the coronary, carotid, peripheral 
arteries and aorta (Shor et al. (1992) South African. Medical 
10 Journal 82:158; Kuo et al. (1993) Journal of Infectious Diseases 
167:841; Kuo et al. (1993) Arteriosclerosis and Thrombosis 
13:1500; Campbell et al (1995) Journal of Infectious Diseases 
172:585; Chiu et al. Circulation, 1997 (In Press)). Viable C. 
pneumoniae has been recovered from the coronary and carotid 
15 artery (Ramirez et al (1996) Annals of Internal Medicine 

125:979; Jackson et al. Abst. K121, p272, 36 th ICAAC, 15-18 
Sept. 1996, New Orleans) . Furthermore, it has been shown that 
C. pneumoniae can induce changes of atherosclerosis in a rabbit 
model (Fong et al (1997) Journal of Clinical Microbiolology 
20 35:48). Taken together, these results indicate that it is 

highly probable that C. pneumoniae can cause atherosclerosis in 
humans, though the epidemiological importance of chlamydial 
atherosclerosis remains to be demonstrated. 

A number of recent studies have also indicated an 
25 association between C. pneumoniae infection and asthma. 

Infection has been linked to wheezing, asthmatic bronchitis, 
adult-onset asthma and acute exacerbations of asthma in adults, 
and small-scale studies have shown that prolonged antibiotic 
treatment was effective at greatly reducing the severity of the 
30 disease in some individuals (Hahn DL, et al. Evidence for 
Chlamydia pneumoniae infection in steroid-dependent asthma. 
Ann Allergy Asthma Immunol. 1998 Jan; 80(1): 45-49.; Hahn DL, et 
al. Association of Chlamydia pneumoniae IgA antibodies with 
recently symptomatic asthma. Epidemiol Infect. 1996 Dec; 
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117(3): 513-517; Bjornsson E, et al. Serology of chlamydia in 
relation to asthma and bronchial hyperresponsiveness . Scand J 
Infect Dis. 1996; 28(1): 63-69.; Hahn DL. Treatment of Chlamydia 
pneumoniae infection in adult asthma: a before-after trial. J 
5 Fam Pract . 1995 Oct; 41(4): 345-351.; Allegra L, et al. Acute 
exacerbations of asthma in adults: role of Chlamydia pneumoniae 
infection. Eur Respir J. 1994 Dec; 7(12): 2165-2168.; Hahn DL, 
et al. Association of Chlamydia pneumoniae (strain TWAR) 
infection with wheezing, asthmatic bronchitis, and adult-onset 

10 asthma. JAMA. 1991 Jul 10; 266(2): 225-230). 

In light of these results a protective vaccine against 
C. pneumoniae infection would be of considerable importance. 
There is not yet an effective vaccine for any human chlamydial 
infection. It is conceivable that an effective vaccine can be 

15 developed using physically or chemically inactivated Chlamydiae. 
However, such a vaccine does not have a high margin of safety. 
In general, safer vaccines are made by genetically manipulating 
the organism by attenuation or by recombinant means. 
Accordingly, a major obstacle in creating an effective and safe 

20 vaccine against human chlamydial infection has been the paucity 
of genetic information regarding Chlamydia, specifically C. 
pneumoniae. 

Studies with C. trachomatis and C. psittaci indicate 
that safe and effective vaccine against Chlamydia is an 

25 attainable goal. For example, mice which have recovered from a 
lung infection with C. trachomatis are protected from 
infertility induced by a subsequent vaginal challenge (Pal et 
al. (1996) Infection and Immunity . 64 : 534 1 ) . Similarly, sheep 
immunized with inactivated C. psittaci were protected from 

30 subsequent chlamydial-induced abortions and stillbirths (Jones 
et al. (1995) Vaccine 13:715) . Protection from chlamydial 
infections has been associated with Thl immune responses, 
particularly the induction of INFg - producing CD4+T-cells 
(Igietsemes et al. (1993) Immunology 5:317). The adoptive 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 




PCT/CA99/00992 



transfer of CD4+ cell lines or clones to nude or SCID mice 
conferred protection from challenge or cleared chronic disease 
(Igietseme et al (1993) Regional Immunology 5:317; Magee et al 
(1993) Regional Immunology 5: 305), and in vivo depletion of 
5 CD4 + T cells exacerbated disease post-challenge (Landers et al 
(1991) Infection & Immunity 59:3774; Magee et al (1995) 
Infection & Immunity 63:516). However, the presence of 
sufficiently high titres of neutralising antibody at mucosal 
surfaces can also exert a protective effect (Cotter et al. 

10 (1995) Infection and Immunity 63:4704). 

Antigenic variation within the species C. pneumoniae 
is not well documented due to insufficient genetic information, 
though variation is expected to exist based on C. trachomatis . 
Serovars of C. trachomatis are defined on the basis of antigenic 

15 variation in MOMP, but published C. pneumoniae MOMP gene 

sequences show no variation between several diverse isolates of 
the organism (Campbell et al (1990) Infection and Immunity 
58:93; McCafferty et al (1995) Infection and Immunity 63:2387-9; 
Knudsen et al (1996) Third Meeting of the European Society for 

20 Chlamydia Research, Vienna) . Regions of the protein known to be 
conserved in other chlamydial MOMPs are conserved in C. 
pneumoniae (Campbell et al (1990) Infection and Immunity 58:93; 
McCafferty et al (1995) Infection and Immunity 63:2387-9). One 
study has described a strain of C. pneumoniae with a MOMP of 

25 greater that usual molecular weight, but the gene for this has 
not been sequenced (Grayston et al. (1995) Journal of Infectious 
Diseases 168:1231). Partial sequences of outer membrane protein 
2 from nine diverse isolates were also found to be invariant 
(Ramirez et al (1996) Annals of Internal Medicine 125:979). The 

30 genes for HSP60 and HSP70 show little variation from other 

chlamydial species, as would be expected. The gene encoding a 
76kDa antigen has been cloned from a single strain of C. 
pneumoniae. It has no significant similarity with other known 
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chlamydial genes (Marrie (1993) Clinical Infectious Diseases. 
18:501) . 

Many antigens recognised by immune sera to C. 
pneumoniae are conserved across all chlamydiae, but 98kDa, 76 
5 kDa and 54 kDa proteins appear to be C. pneu/noniae-specific (Ref 
Campos et al. (1995) Investigation of Ophthalmology and Visual 
Science 36:1477; Marrie (1993) Clinical Infectious Diseases. 
18:501; Wiedmann-Al-Ahmad M, et al. Reactions of polyclonal and 
neutralizing anti-p54 monoclonal antibodies with an isolated, 

10 species-specific 54-kilodalton protein of Chlamydia pneumoniae. 
Clin Diagn Lab Immunol. 1997 Nov; 4(6): 700-704). 
Immunoblotting of isolates with sera from patients does show 
variation of blotting patterns between isolates, indicating that 
serotypes C. pneumoniae may exist (Ref 1,16). However, the 

15 results are potentially confounded by the infection status of 
the patients, since immunoblot profiles of a patient's sera 
change with time post-infection. An assessment of the number 
and relative frequency of any serotypes, and the defining 
antigens, is not yet possible. 

20 Accordingly, a need exists for identifying and 

isolating polynucleotide sequences of C. pneumoniae for use in 
preventing and treating Chlamydia infection. 

SUMMARY OF THE INVENTION 

25 The present invention provides purified and isolated 

polynucleotide molecules that encode Chlamydia polypeptides 
which can be used in methods to prevent, treat, and diagnose 
Chlamydia infection. In one form of the invention, the 
polynucleotide molecules are selected from DNA that encode 

30 polypeptides CPN100397 (SEQ ID Nos : 1 and 2), CPN100421 (SEQ ID 
Nos: 3 and 4), CPN100422 (SEQ ID Nos: 5 and 6), CPN100424 (SEQ 
ID Nos: 7 and 8), CPN100426 (SEQ ID Nos: 9 and 10), CPN100508 
(SEQ ID Nos: 11 and 12), CPN100515 (SEQ ID Nos: 13 and 14), 
CPN100538 (SEQ ID Nos: 15 and 16), CPN100557 (SEQ ID Nos: 17 and 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 




PCT/CA99/00992 



18), CPN100622 (SEQ ID Nos : 19 and 20), CPN100626 (SEQ ID Nos: 
21 and, 22), CPN100628 (SEQ ID Nos: 23 and 24) and CPN100630 
(SEQ ID Nos: 25 and 26). 

Another form of the invention provides polypeptides 
5 corresponding to the isolated DNA molecules. The amino acid 
sequences of the corresponding encoded polypeptides are shown 
for CPN100397 as SEQ ID Nos: 27 and 28, CPN100421 as SEQ ID No: 
29, CPN100422 as SEQ ID No: 30, CPN100424 as SEQ ID No: 31, 
CPN100426 as SEQ ID No: 32, CPN100508 as SEQ ID Nos: 33 and 34, 

10 CPN100515 as SEQ ID Nos: 35 and 36, CPN100538 as SEQ ID No: 37, 
CPN100557 as SEQ ID Nos: 38 and 39, CPN100622 as SEQ ID Nos: 40 
and 41, CPN100626 as SEQ ID No: 42, CPN100628 as SEQ ID No: 43 
and CPN100630 as SEQ ID Nos: 44 and 45. 

Those skilled in the art will readily understand that the 

15 invention, having provided the polynucleotide sequences encoding 
Chlamydia polypeptides, also provides polynucleotides encoding 
fragments derived from such peptides. Moreover, the invention 
is understood to provide mutants and derivatives of such 
polypeptides and fragments derived therefrom, which result from 

20 the addition, deletion, or substitution of non-essential amino 
acids as described herein. Those skilled in the art would also 
readily understand that the invention, having provided the 
polynucleotide sequences encoding Chlamydia polypeptides, 
further provides monospecific antibodies that specifically bind 

25 to such polypeptides 

The present invention has wide application and includes 
expression cassettes, vectors, and cells transformed or 
transfected with the polynucleotides of the invention. 
Accordingly, the present invention further provides (i) a method 

30 for producing a polypeptide of the invention in a recombinant 
host system and related expression cassettes, vectors, and 
transformed or transfected cells; (ii) a vaccine, or a live 
vaccine vector such as a pox virus, Salmonella typhimurium, or 
Vibrio cholerae vector, containing a polynucleotide of the 
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invention, such vaccines and vaccine vectors being useful for, 
e.g., preventing and treating Chlamydia infection, in 
combination with a diluent or carrier, and related 
pharmaceutical compositions and associated therapeutic and/or 
5 prophylactic methods; (iii) a therapeutic and/or prophylactic 
use of an RNA or DNA molecule of the invention, either in a 
naked form or formulated with a delivery vehicle, a polypeptide 
or combination of polypeptides, or a monospecific antibody of 
the invention, and related pharmaceutical compositions; (iv) a 
10 method for diagnosing the presence of Chlamydia in a biological 
sample, which can involve the use of a DNA or RNA molecule, a 
monospecific antibody, or a polypeptide of the invention; and 
(v) a method for purifying a polypeptide of the invention by 
antibody-based affinity chromatography. 

15 

BRIEF DESCRIPTION OF THE DRAWINGS 

The present invention will be further understood from the 
following description with reference to the drawings, in which: 

Figure 1 shows the nucleotide sequence of the CPN100397 
20 (SEQ ID No: 1 - entire sequence and SEQ ID No: 2 - coding 

sequence) and the deduced amino acid sequence of the CPN100397 
protein from Chlamydia pneumoniae (SEQ ID No: 27 and 28) ♦ 

Figure 2 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100397 gene. 
25 Figure 3 shows the nucleotide sequence of the CPN100421 

(SEQ ID No: 3 - entire sequence and SEQ ID No: 4 - coding 
sequence) and the deduced amino acid sequence of the CPN100421 
protein from Chlamydia pneumoniae (SEQ ID No: 29) . 

Figure 4 shows the restriction enzyme analysis of the 
30 gene encoding the C. pneumoniae CPN100421 gene. 

Figure 5 shows the nucleotide sequence of the CPN100422 
(SEQ ID No: 5 - entire sequence and SEQ ID No: 6 - coding 
sequence) and the deduced amino acid sequence of the CPN100422 
protein from Chlamydia pneumoniae (SEQ ID No: 30) . 
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Figure 6 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100422 gene. 

Figure 7 shows the nucleotide sequence of the CPN100424 
(SEQ ID No: 7 - entire sequence and SEQ ID No: 8 - coding 
5 sequence) and the deduced amino acid sequence of the CPN100424 
protein from Chlamydia pneumoniae (SEQ ID No: 31) . 

Figure 8 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100424 gene. 

Figure 9 shows the nucleotide sequence of the CPN100426 
10 (SEQ ID No: 9 - entire sequence and SEQ ID No: 10 - coding 

sequence) and the deduced amino acid sequence of the CPN100426 
protein from Chlamydia pneumoniae (SEQ ID No: 32) . 

Figure 10 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100426 gene. 
15 Figure 11 shows the nucleotide sequence of the CPN100508 

(SEQ ID No: 11 - entire sequence and SEQ ID No: 12 - coding 
sequence) and the deduced amino acid sequence of the 
CPN100508protein from Chlamydia pneumoniae (SEQ ID No: 33 - full 
length sequence and SEQ ID No: 34 - processed sequence) . 
20 Figure 12 shows the restriction enzyme analysis of the 

gene encoding the C. pneumoniae CPN100508 gene. 

Figure 13 shows the nucleotide sequence of the CPN100515 
(SEQ ID No: 13 - entire sequence and SEQ ID No: 14 - coding 
sequence) and the deduced amino acid sequence of the CPN100515 
25 protein from Chlamydia pneumoniae (SEQ ID No: 35 - full length 
se q uence an d SEQ ID No: 36 - processed sequence) . 

Figure 14 shows the restriction enzyme analysis of the 
g ene encoding the C. pneumoniae CPN100515 gene. 

Figure 15 shows the nucleotide sequence of the CPN100538 
30 (SEQ ID No: 15 - entire sequence and SEQ ID No: 16 - coding 

sequence) and the deduced amino acid sequence of the CPN100538 
protein from Chlamydia pneumoniae (SEQ ID No: 37) . 

Figure 16 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100538 gene. 
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Figure 17 shows the nucleotide sequence of the CPN100557 
(SEQ ID No: 17 - entire sequence and SEQ ID No: 18 - coding 
sequence) and the deduced amino acid sequence of the CPN100557 
protein from Chlamydia pneumoniae (SEQ ID No: 38 - full length 
5 sequence and SEQ ID No: 39 - processed sequence) . 

Figure 18 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100557 gene. 

Figure 19 shows the nucleotide sequence of the CPN100622 
(SEQ ID No: 19 - entire sequence and SEQ ID No: 20 - coding 
10 sequence) and the deduced amino acid sequence of the CPN100622 
protein from Chlamydia pneumoniae (SEQ ID No: 40 - full length 
sequence and SEQ ID No: 41 - processed sequence). 

Figuire 20 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100622 gene. 
15 Figure 21 shows the nucleotide sequence of the CPN100626 

(SEQ ID No: 21 - entire sequence and SEQ ID No: 22 - coding 
sequence) and the deduced amino acid sequence of the CPN100626 
protein from Chlamydia pneumoniae (SEQ ID No: 42) . 

Figure 22 shows the restriction enzyme analysis of the 
20 gene encoding the C. pneumoniae CPN100626 gene. 

Figure 23 shows the nucleotide sequence of the CPN100628 
(SEQ ID No: 23 - entire sequence and SEQ ID No: 24 - coding 
sequence) and the deduced amino acid sequence of the CPN100628 
protein from Chlamydia pneumoniae (SEQ ID No: 43) . 
25 Figure 24 shows the restriction enzyme analysis of the 

gene encoding the C. pneumoniae CPN100628 gene. 

Figure 25 shows the nucleotide sequence of the CPN100630 
(SEQ ID No: 25 - entire sequence and SEQ ID No: 26 - coding 
sequence) and the deduced amino acid sequence of the CPN100630 
30 protein from Chlamydia pneumoniae (SEQ ID No: 4 4 - full length 
sequence and SEQ ID No: 45 - processed sequence) . 

Figure 26 shows the restriction enzyme analysis of the 
gene encoding the C. pneumoniae CPN100630 gene. 
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Figures 27 through 39 show an identification of T and B 
cell epitopes from the amino acid sequences shown in the 
foregoing figures . 

5 DETAILED DESCRIPTION OF INVENTION 

Open reading frames (ORFs) encoding chlamydial 
polypeptides have been identified from the C. pneumoniae genome. 
These polypeptides include polypeptides found permanently in 
the bacterial membrane structure, polypeptides present in the 

10 external vicinity of the bacterial membrane, polypeptides found 
permanently in the inclusion membrane structure, polypeptides 
present in the external vicinity of the inclusion membrane, and 
polypeptides released into the cytoplasm of the infected cell. 
These polypeptides can be used to prevent and treat Chlamydia 

15 infection. 

According to a first aspect of the invention, isolated 
polynucleotides are provided which encode the precursor and 
mature forms of Chlamydia polypeptides, whose amino acid 
sequences are selected from the group consisting of: SEQ ID 

20 Nos: 27 to 45. 

The term "isolated polynucleotide" is defined as a 
polynucleotide removed from the environment in which it 
naturally occurs. For example, a naturally-occurring DNA 
molecule present in the genome of a living bacteria or as part 

25 of a gene bank is not isolated, but the same molecule separated 
from the remaining part of the bacterial genome, as a result of, 
e.g., a cloning event (amplification), is isolated. Typically, 
an isolated DNA molecule is free from DNA regions (e.g., coding 
regions) with which it is immediately contiguous at the 5 1 or 3* 

30 end, in the naturally occurring genome. Such isolated 

polynucleotides may be part of a vector or a composition and 
still be defined as isolated in that such a vector or 
composition is not part of the natural environment of such 
polynucleotide . 
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The polynucleotide of the invention is either RNA or DNA 
(cDNA, genomic DNA, or synthetic DNA), or modifications, 
variants, homologs or fragments thereof. The DNA is either 
double-stranded or single-stranded, and, if single-stranded, is 
5 either the coding strand or the non-coding (anti-sense) strand. 
Any one of the sequences that encode the polypeptides of the 
invention as shown in SEQ ID Nos: 1 to 26 is (a) a coding 
sequence, (b) a ribonucleotide sequence derived from 
transcription of (a) , or (c) a coding sequence which uses the 

10 redundancy or degeneracy of the genetic code to encode the same 
polypeptides. By "polypeptide" or "protein" is meant any chain 
of amino acids, regardless of length or post-translational 
modification (e.g., glycosylation or phosphorylation). Both 
terms are used interchangeably in the present application. 

15 Consistent with the first aspect of the invention, amino 

acid sequences are provided which are homologous to any one of 
SEQ ID Nos: 27 to 45. As used herein, "homologous amino acid 
se q uenC e" is any polypeptide which is encoded, in whole or in 
part, by a nucleic acid sequence which hybridizes at 25-35°C 

20 below critical melting temperature (Tm) , to any portion of the 
nucleic acid sequences of SEQ ID Nos: 1 to 26. A homologous 
amino acid sequence is one that differs from an amino acid 
sequence shown in any one of SEQ ID Nos: 27 to 45 by one or 
more amino acid substitutions. Such a sequence also encompass 

25 serotypic variants (defined below) as well as sequences 
containing deletions or insertions which retain inherent 
characteristics of the polypeptide such as immunogenicity . 
Preferably, such a sequence is at least 75%, more preferably 
80%, and most preferably 90% identical to any one of SEQ ID 

30 Nos: 27 to 45. Homologous amino acid sequences include 

sequences that are identical or substantially identical to SEQ 
ID Nos: 27 to 45. By "amino acid sequence substantially 
identical" is meant a sequence that is at least 90%, preferably 
95%, more preferably 97%, and most preferably 99% identical to 
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an amino acid sequence of reference and that preferably dkffers 
from the sequence of reference by a majority of conservative 
amino acid substitutions. 

Conservative amino acid substitutions are substitutions 
5 among amino acids of the same class. These classes include, for 
example, amino acids having uncharged polar side chains, such as 
asparagine, glutamine, serine, threonine, and tyrosine; amino 
acids having basic side chains, such as lysine, arginine, and 
histidine; amino acids having acidic side chains, such as 
10 aspartic acid and glutamic acid; and amino acids having nonpolar 
side chains, such as glycine, alanine, valine, leucine, 
isoleucine, proline, phenylalanine, methionine, tryptophan, and 
cysteine . 

Homology is measured using sequence analysis software 

15 such as Sequence Analysis Software Package of the Genetics 
Computer Group, University of Wisconsin Biotechnology Center, 
1710 University Avenue, Madison, WI 53705. Amino acid sequences 
are aligned to maximize identity. Gaps may be artificially 
introduced into the sequence to attain proper alignment. Once 

20 the optimal alignment has been set up, the degree of homology is 
established by recording all of the positions in which the amino 
acids of both sequences are identical, relative to the total 
number of positions. 

Homologous polynucleotide sequences are defined in a 

25 similar way. Preferably, a homologous sequence is one that is 
at least 45%, more preferably 60%, and most preferably 85% 
identical to any one of coding sequences SEQ ID Nos : 1 to 26. 

Consistent with the first aspect of the invention, 
polypeptides having a sequence homologous to any one of SEQ ID 

30 Nos: 27 to 45 include naturally-occurring allelic variants, as 
well as mutants or any other non-naturally occurring variants 
that retain the inherent characteristics of the polypeptide of 
SEQ ID Nos: 27 to 45. 
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As is known in the art, an allelic variant is an 
alternate form of a polypeptide that is characterized as having 
a substitution, deletion, or addition of one or more amino acids 
that does not alter the biological function of the polypeptide. 
5 By "biological function" is meant the function of the 

polypeptide in the cells in which it naturally occurs, even if 
the function is not necessary for the growth or survival of the 
cells. For example, the biological function of a porin is to 
allow the entry into cells of compounds present in the 

10 extracellular medium. Biological function is distinct from 
antigenic property. A polypeptide can have more than one 
biological function . 

Allelic variants are very common in nature. For example, 
a bacterial species such as C. pneumoniae, is usually 

15 represented by a variety of strains that differ from each other 
by minor allelic variations. Indeed, a polypeptide that 
fulfills the same biological function in different strains can 
have an amino acid sequence (and polynucleotide sequence) that 
are not identical in each of the strains. Despite this 

20 variation, an immune response directed generally against many 
allelic variants has been demonstrated. In studies of the 
Chlamydial MOMP antigen, cross-strain antibody binding plus 
neutralization of infectivity occurs despite amino acid sequence 
variation of MOMP from strain to strain, indicating that the 

25 MOMP, when used as an immunogen, is tolerant of amino acid 
variations . 

Polynucleotides encoding homologous polypeptides or 
allelic variants are retrieved by polymerase chain reaction 
(PCR) amplification of genomic bacterial DNA extracted by 
30 conventional methods. This involves the use of synthetic 

oligonucleotide primers matching upstream and downstream of the 
5 f and 3 1 ends of the encoding domain. Suitable primers are 
designed according to the nucleotide sequence information 
provided in SEQ ID Nos:l to 26. The procedure is as follows: a 
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primer is selected which consists of 10 to 40, preferably 15 to 
25 nucleotides. It is advantageous to select primers containing 
C and G nucleotides in a proportion sufficient to ensure 
efficient hybridization; i.e., an amount of C and G nucleotides 
5 of at least 40%, preferably 50% of the total nucleotide content. 
An alternative method for retrieving polynucleotides 
encoding homologous polypeptides or allelic variants is by 
hybridization screening of a DNA or RNA library. Hybridization 
procedures are well-known in the art and are described in 

10 Ausubel et al., (Ref 41), Silhavy et al. (Ref 43), and Davis et 
al. (ref 44) . Important parameters for optimizing hybridization 
conditions are reflected in a formula used to obtain the 
critical melting temperature above which two complementary DNA 
strands separate from each other (Ref 45) . For polynucleotides 

15 of about 600 nucleotides or larger, this formula is as follows: 
Tm = 81.5 + 0.5 x (% G+C) + 1.6 log (positive ion concentration) 
- 0.6 x (% formamide) . Under appropriate stringency conditions, 
hybridization temperature (Th) is approximately 20 to 40 °C, 20 
to 25°C, or, preferably 30 to 40°C below the calculated Tm. 

20 Those skilled in the art will understand that optimal 

temperature and salt conditions can be readily determined. 

For the polynucleotides of the invention, stringent 
conditions are achieved for both pre-hybridizing and hybridizing 
incubations (i) within 4-16 hours at 42°C, in 6 x SSC containing 

25 50% formamide, or (ii) within 4-16 hours at 65°C in an aqueous 
6 x SSC solution (1 M NaCl, 0.1 M sodium citrate (pH 7.0)). 

Useful homologs and fragments thereof that do not occur 
naturally are designed using known methods for identifying 
regions of an antigen that are likely to tolerate amino acid 

30 sequence changes and/or deletions. As an example, homologous 
polypeptides from different species are compared; conserved 
sequences are identified. The more divergent sequences are the 
most likely to tolerate sequence changes. Alternatively, 
sequences are modified such that they become more reactive to T- 
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and/or B-cells. (See Table below for identification of T- and 
B- epitopes.) Yet another alternative is to mutate a particular 
amino acid residue or sequence within the polypeptide in vitro, 
then screen the mutant polypeptides for their ability to prevent 
5 or treat Chlamydia infection according to the method outlined 
below . 

A person skilled in the art will readily understand that 
by following the screening process of this invention, it will be 
determined without undue experimentation whether a particular 
10 homolog of any of SEQ ID Nos : 27 to 45 may be useful in the 

prevention or treatment of Chlamydia infection. The screening 
procedure comprises the steps: 

(i) immunizing an animal, preferably mouse, with the 
test homolog or fragment; 
15 (ii) inoculating the immunized animal with Chlamydia; 

and 

(iii) selecting those homologs or fragments which confer 

protection against Chlamydia. 
By "conferring protection" is meant that there is a 
20 reduction is severity of any of the effects of Chlamydia 

infection, in comparison with a control animal which was not 
immunized with the test homolog or fragment. 

It has been previously demonstrated (Yang et. al., 1993) 
that mice are susceptible to intranasal infection with different 
25 isolates of C. pneumoniae. Strain AR-39 (Grayston, 1989) was 
used in Balb/c mice as a challenge infection model to examine 
the capacity of chlamydia gene products delivered as naked DNA 
to elicit a protective response against a sublethal C. 
pneumoniae lung infection. Protective immunity is defined as an 
30 accelerated clearance of pulmonary infection. 

Groups of 7 to 9 week old male Balb/c mice (6 to 10 per 
group) were immunized intramuscularly (i.m.) plus intranasally 
(i.n.) with plasmid DNA containing the coding sequence of a 
C. pneumoniae polypeptide. Saline or the plasmid vector lacking 
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an inserted chlamydial gene was given to groups of control 
animals . 

For i.m. immunization alternate left and right 
quadriceps were injected with lOOjag of DNA in 50\il of PBS on 
5 three occasions at 0, 3 and 6 weeks. For i.n. immunization, 

anaesthetized mice aspirated 50^x1 of PBS containing 50 \iq DNA on 
three occasions at 0, 3 and 6 weeks. At week 8, immunized mice 
were inoculated i.n. with 5 x 10 5 IFU of C. pneumoniae, strain 
AR39 in 100|il of SPG buffer to test their ability to limit the 

10 growth of a sublethal C. pneumoniae challenge. 

Lungs were taken from mice at day 9 post-challenge and 
immediately homogenised in SPG buffer (7.5% sucrose, 5mM 
glutamate, 12.5mM phosphate pH7.5). The homogenate was stored 
frozen at -70°C until assay. Dilutions of the homogenate were 

15 assayed for the presence of infectious chlamydia by inoculation 
onto monolayers of susceptible cells. The inoculum was 
centrifuged onto the cells at 3000rpm for 1 hour, then the cells 
were incubated for three days at 35°C in the presence of 1 jig /ml 
cycloheximide . After incubation the monolayers were fixed with 

20 formalin and methanol then irnmunoperoxidase stained for the 

presence of chlamydial inclusions using convalescent sera from 
rabbits infected with C. pneumoniae and metal-enhanced DAB as a 
peroxidase substrate . 

Consistent with the first aspect of the invention, 

25 polypeptide derivatives are provided that are partial sequences 
of SEQ ID Nos: 27 to 45, partial sequences of polypeptide 
sequences homologous to SEQ ID Nos: 27 to 45, polypeptides 
derived from full-length polypeptides by internal deletion, and 
fusion proteins. 

30 It is an accepted practice in the field of immunology to 

use fragments and variants of protein immunogens as vaccines, as 
all that is required to induce an immune response to a protein 
is a small (e.g., 8 to 10 amino acid) immunogenic region of the 
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protein. Various short synthetic peptides corresponding to 
surface-exposed antigens of pathogens other than Chlamydia have 
been shown to be effective vaccine antigens against their 
respective pathogens, e.g. an 11 residue peptide of murine 
5 mammary tumor virus (Ref 38), a 16-residue peptide of Semliki 
Forest virus (Ref 39) , and two overlapping peptides of 15 
residues each from canine parvovirus (Ref 40) . 

Accordingly, it will be readily apparent to one skilled 
in the art, having read the present description, that partial 

10 sequences of SEQ ID Nos: 27 to 45 or their homologous amino acid 
sequences are inherent to the full-length sequences and are 
taught by the present invention. . Such polypeptide fragments 
preferably are at least 12 amino acids in length. 
Advantageously, they are at least 20 amino acids, preferably at 

15 least 50 amino acids, more preferably at least 75 amino acids, 
and most preferably at least 100 amino acids in length. 

Polynucleotides of 30 to 600 nucleotides encoding partial 
sequences of sequences homologous to SEQ ID Nos: 27 to 45 are 
retrieved by PCR amplification using the parameters outlined 

2 0 above and using primers matching the sequences upstream and 
downstream of the 5' and 3' ends of the fragment to be 
amplified. The template polynucleotide for such amplification 
is either the full length polynucleotide homologous to one of 
SEQ ID Nos: 1 to 26, or a polynucleotide contained in a mixture 

25 of polynucleotides such as a DNA or RNA library. As an 
alternative method for retrieving the partial sequences, 
screening hybridization is carried out under conditions 
described above and using the formula for calculating Tm. Where 
fragments of 30 to 600 nucleotides are to be retrieved, the 

30 calculated Tm is corrected by subtracting ( 600/polynucleotide 

size in base pairs) and the stringency conditions are defined by 
a hybridization temperature that is 5 to 10°C below Tm. Where 
oligonucleotides shorter than 20-30 bases are to be obtained, 
the formula for calculating the Tm is as follows: Tm = 4 x (G+C) 
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+ 2 (A+T) . For example, an 18 nucleotide fragment of 50% G+C 
would have an approximate Tm of 54 °C. Short peptides that are 
fragments of SEQ. ID Nos. 27 to 45 or their homologous 
sequences, are obtained directly by chemical synthesis (E. Gross 
5 and H. J. Meinhofer, 4 The Peptides: Analysis, Synthesis, 

Biology; Modern Techniques of Peptide Synthesis, John Wiley & 
Sons (1981), and M. Bodanzki, Principles of Peptide Synthesis, 
Springer -Verlag (1984)). 

Useful polypeptide derivatives, e.g., polypeptide 

10 fragments, are designed using computer-assisted analysis of 
amino acid sequences. This identifies probable surface- 
exposed, antigenic regions (Ref 37) . An analysis of the 13 
amino acid sequences contained in SEQ ID Nos: 27 to 45, based on 
the product of flexibility and hydrophobicity propensities using 

15 the program SEQSEE (Wishart DS, et al. "SEQSEE: a comprehensive 
program suite for protein sequence analysis." Comput Appl 
Biosci. 1994 Apr; 10 (2) : 121-32) / reveal a number of potential B- 
and T-cell epitopes which may be used as a basis for selecting 
useful immunogenic fragments and variants. The results are 

20 shown in Figures 27 to 39. This analysis uses a reasonable 
combination of external surface features that is likely to be 
recognized by antibodies. Probable T-cell epitopes for HLA- 
A0201 MHC subclass were revealed by an algorithm written at 
Connaught Laboratories that emulates an approach developed at 

25 the NIH (Parker KC, et al. "Peptide binding to MHC class I 
molecules: implications for antigenic peptide prediction 
Immunol Res 1995; 14 ( 1 ): 34-57 ). 

Epitopes which induce a protective T cell-dependent 
immune response are present throughout the length of the 

30 polypeptide. However, some epitopes may be masked by secondary 
and tertiary structures of the polypeptide. To reveal such 
masked epitopes large internal deletions are created which 
remove much of the original protein structure and exposes the 
masked epitopes. Such internal deletions sometimes effects the 
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additional advantage of removing immunodominant regions of high 
variability among strains. Polynucleotides encoding polypeptide 
fragments and polypeptides having large internal deletions are 
constructed using standard methods (Ref 41) . Such methods 
5 include standard PCR, inverse PCR, restriction enzyme treatment 
of cloned DNA molecules, or the method of Kunkel et al. (Ref 
42) . Components for these methods and instructions for their 
use are readily available from various commercial sources such 
as Stratagene. Once the deletion mutants have been constructed, 

10 they are tested for their ability to prevent or treat Chlamydia 
infection as described above. 

As used herein, a fusion polypeptide is one that contains 
a polypeptide or a polypeptide derivative of the invention fused 
at the N- or C-terminal end to any other polypeptide 

15 (hereinafter referred to as a peptide tail) . A simple way to 
obtain such a fusion polypeptide is by translation of an in- 
frame fusion of the polynucleotide sequences, i.e., a hybrid 
gene. The hybrid gene encoding the fusion polypeptide is 
inserted into an expression vector which is used to transform or 

20 transfect a host cell. Alternatively, the polynucleotide 

sequence encoding the polypeptide or polypeptide derivative is 
inserted into an expression vector in which the polynucleotide 
encoding the peptide tail is already present. Such vectors and 
instructions for their use are commercially available, e.g. 

25 the pMal-c2 or pMal-p2 system from New England Biolabs, in 
which the peptide tail is a maltose binding protein, the 
glutathione-S-transf erase system of Pharmacia, or the His-Tag 
system available from Novagen. These and other expression 
systems provide convenient means for further purification of 

30 polypeptides and derivatives of the invention. 

An advantageous example of a fusion polypeptide is one 
where the polypeptide or homolog or fragment of the invention is 
fused to a polypeptide having adjuvant activity, such as subunit 
B of either cholera toxin or E. coli heat-labile toxin. Another 
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advantageous fusion is one where the polypeptide, homolog or 
fragment is fused to a strong T-cell epitope or B-cell epitope. 
Such an epitope may be one known in the art (e.g. the Hepatitis 
B virus core antigen, D.R. Millich et al., " Antibody production 
5 to the nucleocapsid and envelope of the Hepatitis B virus primed 
by a single synthetic T cell site", Nature. 1987. 329:547-549), 
or one which has been identified in another polypeptide of the 
invention (Table ) . Consistent with this aspect of the 
invention is a fusion polypeptide comprising T- or B-cell 

10 epitopes from one of SEQ ID Nos : 27 to 45 or its homolog or 
fragment, wherein the epitopes are derived from multiple 
variants of said polypeptide or homolog or fragment, each 
variant differing from another in the location and sequence of 
its epitope within the polypeptide. Such a fusion is effective 

15 in the prevention and treatment of Chlamydia infection since it 
optimizes the T- and B-cell response to the overall polypeptide, 
homolog or fragment. 

To effect fusion, the polypeptide of the invention is 
fused to the N-, or preferably, to the C-terminal end of the 

20 polypeptide having adjuvant activity or T- or B-cell epitope. 
Alternatively, a polypeptide fragment of the invention is 
inserted internally within the amino acid sequence of the 
polypeptide having adjuvant activity. The T- or B-cell epitope 
may also be inserted internally within the amino acid sequence 

25 of the polypeptide of the invention. 

Consistent with the first aspect, the polynucleotides of 
the invention also encode hybrid precursor polypeptides 
containing heterologous signal peptides, which mature into 
polypeptides of the invention. By "heterologous signal peptide" 

30 is meant a signal peptide that is not found in naturally- 
occurring precursors of polypeptides of the invention. 

A polynucleotide molecule according to the invention, 
including RNA, DNA, or modifications or combinations thereof, 
have various applications. A DNA molecule is used, for example, 
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(i) in a process for producing the encoded polypeptide in a 
recombinant host system, (ii) in the construction of vaccine 
vectors such as poxviruses, which are further used in methods 
and compositions for preventing and/or treating Chlamydia 
5 infection, (iii) as a vaccine agent (as well as an RNA 

molecule) , in a naked form or formulated with a delivery vehicle 
and, (iv) in the construction of attenuated Chlamydia strains 
that can over-express a polynucleotide of the invention or 
express it in a non-toxic, mutated form. 

10 Accordingly, a second aspect of the invention encompasses 

(i) an expression cassette containing a DNA molecule of the 
invention placed under the control of the elements required for 
expression, in particular under the control of an appropriate 
promoter; (ii) an expression vector containing an expression 

15 cassette of the invention; (iii) a procaryotic or eucaryotic 
cell transformed or transfected with an expression cassette 
and/or vector of the invention, as well as (iv) a process for 
producing a polypeptide or polypeptide derivative encoded by a 
polynucleotide of the invention, which involves culturing a 

20 procaryotic or eucaryotic cell transformed or transfected with 
an expression cassette and/or vector of the invention, under 
conditions that allow expression of the DNA molecule of the 
invention and, recovering the encoded polypeptide or polypeptide 
derivative from the cell culture. 

25 A recombinant expression system is selected from 

procaryotic and eucaryotic hosts. Eucaryotic hosts include 
yeast cells (e.g., Saccharomyces cerevisiae or Pichia pastoris) , 
mammalian cells (e.g., C0S1, NIH3T3, or JEG3 cells), arthropods 
cells (e.g., Spodoptera frugiperda (SF9) cells), and plant 

30 cells. A preferred expression system is a procaryotic host such 
as E. coli. Bacterial and eucaryotic cells are available from a 
number of different sources including commercial sources to 
those skilled in the art, e.g., the American Type Culture 
Collection (ATCC; Rockville, Maryland) . Commercial sources of 
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cells used for recombinant protein expression also provide 
instructions for usage of the cells. 

The choice of the expression system depends on the 
features desired for the expressed polypeptide. For example, it 
5 may be useful to produce a polypeptide of the invention in a 
particular lipidated form or any other form- 
One skilled in the art would redily understand that not 
all vectors and expression control sequences and hosts would be 
expected to express equally well the polynucleotides of this 

10 invention. With the guidelines described below, however, a 

selection of vectors, expression control sequences and hosts may 
be made without undue experimentation and without departing from 
the scope of this invention. 

In selecting a vector, the host must be chosen that is 

15 compatible with the vector which is to exist and possibly 

replicate in it. Considerations are made with respect to the 
vector copy number, the ability to control the copy number, 
expression of other proteins such as antibiotic resistance. In 
selecting an expression control sequence, a number of variables 

20 are considered. Among the important variable are the relative 
strength of the sequence (e.g. the ability to drive expression 
under various conditions), the ability to control the sequence's 
function, compatibility between the polynucleotide to be 
expressed and the control sequence (e.g. secondary structures 

25 are considered to avoid hairpin structures which prevent 

efficient transcription) . In selecting the host, unicellular 
hosts are selected which are compatible with the selected 
vector, tolerant of any possible toxic effects of the expressed 
product, able to secrete the expressed product efficiently if 

30 such is desired, to be able to express the product in the 

desired conformation, to be easily scaled up, and to which ease 
of purification of the final product. 

The choice of the expression cassette depends on the 
host system selected as well as the features desired for the 
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expressed polypeptide. Typically, an expression cassette 
includes a promoter that is functional in the selected host 
system and can be constitutive or inducible; a ribosome binding 
site; a start codon (ATG) if necessary; a region encoding a 
5 signal peptide, e.g., a lipidation signal peptide; a DNA 
molecule of the invention; a stop codon; and optionally a 3 f 
terminal region (translation and/or transcription terminator) , 
The signal peptide encoding region is adjacent to the 
polynucleotide of the invention and placed in proper reading 

10 frame. The signal peptide-encoding region is homologous or 

heterologous to the DNA molecule encoding the mature polypeptide 
and is compatible with the secretion apparatus of the host used 
for expression. The open reading frame constituted by the DNA 
molecule of the invention, solely or together with the signal 

15 peptide, is placed under the control of the promoter so that 
transcription and translation occur in the host system. 
Promoters and signal peptide encoding regions are widely known 
and available to those skilled in the art and include, for 
example, the promoter of Salmonella typhimurium (and 

20 derivatives) that is inducible by arabinose (promoter araB) and 
is functional in Gram-negative bacteria such as E. coli (as 
described in U.S. Patent No. 5,028,530 and in Cagnon et al., 
(Ref 46)); the promoter of the gene of bacteriophage T7 encoding 
RNA polymerase, that is functional in a number of E. coli 

25 strains expressing T7 polymerase (described in U.S. Patent 
No. 4 K 952, 4 96); OspA lipidation signal peptide ; and RlpB 
lipidation signal peptide (Ref 47). 

The expression cassette is typically part of an 
expression vector, which is selected for its ability to 

30 replicate in the chosen expression system. Expression vectors 
(e.g., plasmids or viral vectors) can be chosen, for example, 
from those described in Pouwels et al. (Cloning Vectors: A 
Laboratory Manual 1985, Supp. 1987) . Suitable expression 
vectors can be purchased from various commercial sources. 
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Methods for transf orming/transf ecting host cells with 
expression vectors are well-known in the art and depend on the 
host system selected as described in Ausubel et al., (Ref 41). 
Upon expression, a recombinant polypeptide of the 
5 invention (or a polypeptide derivative) is produced and remains 
in the intracellular compartment, is secreted/excreted in the 
extracellular medium or in the periplasmic space, or is embedded 
in the cellular membrane. The polypeptide is recovered in a 
substantially purified form from the cell extract or from the 
10 supernatant after centrif ugation of the recombinant cell 

culture. Typically, the recombinant polypeptide is purified by 
antibody-based affinity purification or by other well-known 
methods that can be readily adapted by a person skilled in the 
art, such as fusion of the polynucleotide encoding the 
15 polypeptide or its derivative to a small affinity binding 

domain. Antibodies useful for purifying by immunoaf f inity the 
polypeptides of the invention are obtained as described below. 

A polynucleotide of the invention can also be useful as a 
vaccine. There are two major routes, either using a viral or 
20 bacterial host as gene delivery vehicle (live vaccine vector) or 
administering the gene in a free form, e.g., inserted into a 
plasmid. Therapeutic or prophylactic efficacy of a 
polynucleotide of the invention is evaluated as described below. 

Accordingly, a third aspect of the invention provides (i) 
25 a vaccine vector such as a poxvirus, containing a DNA molecule 
of the invention, placed under the control of elements required 
for expression; (ii) a composition of matter comprising a 
vaccine vector of the invention, together with a diluent or 
carrier; specifically (iii) a pharmaceutical composition 
30 containing a therapeutically or prophylactically effective 

amount of a vaccine vector of the invention; (iv) a method for 
inducing an immune response against Chlamydia in a mammal (e.g., 
a human; alternatively, the method can be used in veterinary 
applications for treating or preventing Chlamydia infection of 
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animals, e.g., cats or birds), which involves administering to 
the mammal an immunogenically effective amount of a vaccine 
vector of the invention to elicit a protective or therapeutic 
immune response to Chlamydia ; and particularly, (v) a method 
for preventing and/or treating a Chlamydia {e.g., C. 
trachomatis, C. psittaci , C. pneumonia, C. pecorum) infection, 
which involves administering a prophylactic or therapeutic 
amount of a vaccine vector of the invention to an infected 
individual. Additionally, the third aspect of the invention 
encompasses the use of a vaccine vector of the invention in the 
preparation of a medicament for preventing and/or treating 
Chlamydia infection. 

As used herein, a vaccine vector expresses one or several 
polypeptides or derivatives of the invention, as well as at 
least one additional Chlamydia antigen (??), fragment, homolog, 
mutant, or derivative thereof. The vaccine vector may express 
additionally a cytokine, such as interleukin-2 (IL-2) or 
interleukin-12 (IL-12) , that enhances the immune response 
(adjuvant effect) . It is understood that each of the components 
to be expressed is placed under the control of elements required 
for expression in a mammalian cell. 

Consistent with the third aspect of the invention is a 
composition comprising several vaccine vectors, each of them 
capable of expressing a polypeptide or derivative of the 
invention. A composition may also comprise a vaccine vector 
capable of expressing an additional Chlamydia antigen, or a 
subunit, fragment, homolog, mutant, or derivative thereof; or a 
cytokine such as IL-2 or IL-12. 

Vaccination methods for treating or preventing infection 
in a mammal comprises use of a vaccine vector of the invention 
to be administered by any conventional route , particularly to a 
mucosal (e.g., ocular, intranasal, oral, gastric, pulmonary, 
intestinal, rectal, vaginal, or urinary tract) surface or via 
the parenteral {e.g., subcutaneous, intradermal, intramuscular, 
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intravenous, or intraperitoneal) route. Preferred routes depend 
upon the choice of the vaccine vector. Treatment may be 
effected in a single dose or repeated at intervals. The 
appropriate dosage depends on various parameters understood by 
5 skilled artisans such as the vaccine vector itself, the route of 
administration or the condition of the mammal to be vaccinated 
(weight, age and the like) . 

Live vaccine vectors available in the art include viral 
vectors such as adenoviruses and poxviruses as well as bacterial 
10 vectors, e.g., Shigella, Salmonella, Vibrio cholerae, 

Lactobacillus , Bacille bilie de Calmette-Guerin (BCG) , and 
Streptococcus . 

An example of an adenovirus vector, as well as a method 
for constructing an adenovirus vector capable of expressing a 

15 DNA molecule of the invention, are described in U.S. Patent No. 
4,920,209. Poxvirus vectors include vaccinia and canary pox 
virus, described in U.S. Patent No. 4,722,848 and U.S. Patent 
No. 5,364,773, respectively. (Also see, e.g., Tartaglia et al. f 
Virology (1992) 188:217) for a description of a vaccinia virus 

20 vector and Taylor et al, Vaccine (1995) 13:539 for a reference 
of a canary pox.) Poxvirus vectors capable of expressing a 
polynucleotide of the invention are obtained by homologous 
recombination as described in Kieny et al., Nature (1984) 
312:163 so that the polynucleotide of the invention is inserted 

25 in the viral genome under appropriate conditions for expression 
in mammalian cells. Generally, the dose of vaccine viral 
vector, for therapeutic or prophylactic use, can be of from 
about lxlO 4 to about lxlO 11 , advantageously from about lxlO 7 to 
about lxlO 10 , preferably of from about lxlO 7 to about lxlO 9 

30 plaque-forming units per kilogram. Preferably, viral vectors 
are administered parenterally ; for example, in 3 doses, 4 weeks 
apart. It is preferable to avoid adding a chemical adjuvant to 
a composition containing a viral vector of the invention and 
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thereby minimizing the immune response to the viral vector 
itself . 

Non-toxicogenic Vibrio cholerae mutant strains that are 
useful as a live oral vaccine are known. Mekalanos et al., 
5 Nature (1983) 306:551 and U.S. Patent No. 4,882,278 describe 
strains which have a substantial amount of the coding sequence 
of each of the two ctxA alleles deleted so that no functional 
cholerae toxin is produced. WO 92/11354 describes a strain in 
which the irgA locus is inactivated by mutation; this mutation 

10 can be combined in a single strain with ctxA mutations. WO 

94/1533 describes a deletion mutant lacking functional ctxA and 
attRSl DNA sequences. These mutant strains are genetically 
engineered to express heterologous antigens, as described in 
WO 94/19482. An effective vaccine dose of a Vibrio cholerae 

15 strain capable of expressing a polypeptide or polypeptide 

derivative encoded by a DNA molecule of the invention contains 
about 1x10 s to about IxlO 9 , preferably about lxlO 6 to about IxlO 8 , 
viable bacteria in a volume appropriate for the selected route 
of administration. Preferred routes of administration include 

20 all mucosal routes; most preferably, these vectors are 
administered intranasally or orally. 

Attenuated Salmonella typhimurium strains, genetically 
engineered for recombinant expression of heterologous antigens 
or not, and their use as oral vaccines are described in 

25 Nakayama et al. (Bio/Technology (1988) 6:693) and WO 92/11361. 
Preferred routes of administration include all mucosal routes; 
most preferably, these vectors are administered intranasally or 
orally • 

Other bacterial strains used as vaccine vectors in the 
30 context of the present invention are described in High et al., 
EMBO (1992) 11:1991 and Sizemore et al., Science (1995) 270:299 
(Shigella flexneri) ; Medaglini et al. , Proc. Natl. Acad. Sci. 
USA (1995) 92:6868 (Streptococcus gordonii) , Flynn J.L., Cell. 
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Mol. Biol. (1994) 40 (suppl. I): 31, WO 88/6626, WO 90/0594, WO 
91/13157, WO 92/1796, and WO 92/21376 (Bacille Calmette Guerin) . 

In bacterial vectors, the polynucleotide of the invention 
is inserted into the bacterial genome or remains in a free 
5 state as part of a plasmid. 

The composition comprising a vaccine bacterial vector of 
the present invention may further contain an adjuvant . A 
number of adjuvants are known to those skilled in the art. 
Preferred adjuvants are selected from the list provided below. 

10 Accordingly, a fourth aspect of the invention provides 

(i) a composition of matter comprising a polynucleotide of the 
invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition comprising a therapeutically or 
prophylactically effective amount of a polynucleotide of the 

15 invention; (iii) a method for inducing an immune response 
against Chlamydia in a mammal by administration of an 
immunogenically effective amount of a polynucleotide of the 
invention to elicit a protective immune response to Chlamydia; 
and particularly, (iv) a method for preventing and/or treating a 

20 Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae, or 
C. pecorum) infection, by administering a prophylactic or 
therapeutic amount of a polynucleotide of the invention to an 
infected individual. Additionally, the fourth aspect of the 
invention encompasses the use of a polynucleotide of the 

25 invention in the preparation of a medicament for preventing 

and/or treating Chlamydia infection. A preferred use includes 
the use of a DNA molecule placed under conditions for expression 
in a mammalian cell, especially in a plasmid that is unable to 
replicate in mammalian cells and to substantially integrate in a 

30 mammalian genome. 

Use of the polynucleotides of the invention include their 
administration to a mammal as a vaccine, for therapeutic or 
prophylactic purposes. Such polynucleotides are used in the 
form of DNA as part of a plasmid that is unable to replicate in 
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a mammalian cell and unable to integrate into the mammalian 
genome. Typically, such a DNA molecule is placed under the 
control of a promoter suitable for expression in a mammalian 
cell. The promoter functions either ubiquitously or tissue- 
5 specifically. Examples of non-tissue specific promoters include 
the early Cytomegalovirus (CMV) promoter (described in U.S. 
Patent No. 4,168,062) and the Rous Sarcoma Virus promoter 
(described in Norton & Coffin, Molec. Cell Biol. (1985) 5:281) . 
An example of a tissue-specific promoter is the desmin promoter 

10 which drives expression in muscle cells (Li et al. , Gene (1989) 
78:243, Li & Paulin, J. Biol. Chem. (1991) 266:6562 and Li & 
Paulin, J. Biol. Chem. (1993) 268:10403). Use of promoters is 
well-known to those skilled in the art. Useful vectors are 
described in numerous publications, specifically WO 94/21797 and 

15 Hartikka et al., Human Gene Therapy (1996) 7:1205. 

Polynucleotides of the invention which are used as a 
vaccine encode either a precursor or a mature form of the 
corresponding polypeptide. In the precursor form, the signal 
peptide is either homologous or heterologous. In the latter 

20 case, a eucaryotic leader sequence such as the leader sequence 
of the tissue-type plasminogen factor (tPA) is preferred. 

As used herein, a composition of the invention contains 
one or several polynucleotides with optionally at least one 
additional polynucleotide encoding another Chlamydia antigen 

25 such as urease subunit A, B, or both, or a fragment, derivative, 
mutant, or analog thereof. The composition may also contain an 
additional polynucleotide encoding a cytokine, such as 
interleukin-2 (IL-2) or interleukin-12 (IL-12) so that the 
immune response is enhanced. These additional polynucleotides 

30 are placed under appropriate control for expression. 

Advantageously, DNA molecules of the invention and/or additional 
DNA molecules to be included in the same composition, are 
present in the same plasmid. 
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Standard techniques of molecular biology for preparing 
and purifying polynucleotides are used in the preparation of 
polynucleotide therapeutics of the invention. For use as a 
vaccine, a polynucleotide of the invention is formulated 
5 according to various methods outlined below. 

One method utililizes the polynucleotide in a naked 
form, free of any delivery vehicles. Such a polynucleotide is 
simply diluted in a physiologically acceptable solution such as 
sterile saline or sterile buffered saline, with or without a 

10 carrier* When present, the carrier preferably is isotonic, 

hypotonic, or weakly hypertonic, and has a relatively low ionic 
strength, such as provided by a sucrose solution, e.g., a 
solution cohtaining 20% sucrose. 

An alternative method utilizes the polynucleotide in 

15 association with agents that assist in cellular uptake. 

Examples of such agents are (i) chemicals that modify cellular 
permeability, such as bupivacaine (see, e.g., WO 94/16737), (ii) 
liposomes for encapsulation of the polynucleotide, or 
(iii) cationic lipids or silica, gold, or tungsten 

20 microparticles which associate themselves with the 
polynucleotides . 

Anionic and neutral liposomes are well-known in the art 
(see, e.g., Liposomes: A Practical Approach, RPC New Ed, IRL 
press (1990), for a detailed description of methods for making 

25 liposomes) and are useful for delivering a large range of 

products, including polynucleotides. Cationic lipids are also 
known in the art and are commonly used for gene delivery. Such 
lipids include Lipofectin™ also known as DOTMA (N-[l-(2,3- 
dioleyloxy) propyl] -N, N, N-trimethylammonium chloride) , DOTAP 

30 (1, 2-bis (oleyloxy) -3- ( trimethylammonio) propane) , DDAB 
(dimethyldioctadecylammonium bromide), DOGS 

(dioctadecylamidologlycyl spermine) and cholesterol derivatives 
such as DC-Choi (3 beta- (N- (N 1 , N 1 -dimethyl aminomethane ) - 
carbamoyl) cholesterol) . A description of these cationic lipids 
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can be found in EP 187,702, WO 90/11092, U.S. Patent 
No. 5,283,185, WO 91/15501, WO 95/26356, and U.S. Patent 
No. 5,527,928. Cationic lipids for gene delivery are preferably 
used in association with a neutral lipid such as DOPE (dioleyl 
5 phosphatidylethanolamine) , as described in WO 90/11092 as an 
example . 

Formulation containing cationic liposomes may optionally 
contain other transf ection-f acilitating compounds. A number of 
them are described in WO 93/18759, WO 93/19768, WO 94/25608, and 

10 WO 95/2397. They include spermine derivatives useful for 

facilitating the transport of DNA through the nuclear membrane 
(see, for example, WO 93/18759) and membrane-permeabilizing 
compounds such as GALA, Gramicidine S, and cationic bile salts 
(see, for example, WO 93/19768). 

15 Gold or tungsten microparticles are used for gene 

delivery, as described in WO 91/359, WO 93/17706, and Tang et 
al. (Nature (1992) 356:152). The microparticle-coated 
polynucleotide is injected via intradermal or intraepidermal 
routes using a needleless injection device ("gene gun"), such as 

20 those described in U.S. Patent No. 4,945,050, U.S. Patent 
No. 5,015,580, and WO 94/24263. 

The amount of DNA to be used in a vaccine recipient 
depends, e.g., on the strength of the promoter used in the DNA 
construct, the immunogenicity of the expressed gene product, the 

25 condition of the mammal intended for administration (e.g., the 
weight, age, and general health of the mammal) , the mode of 
administration, and the type of formulation. In general, a 
therapeutically or prophylactically effective dose from about 1 
pg to about 1 mg, preferably, from about 10 pg to about 800 pg 

30 and, more preferably, from about 25 pg to about 250 pg, can be 
administered to human adults. The administration can be 
achieved in a single dose or repeated at intervals. 

The route of administration is any conventional route 
used in the vaccine field. As general guidance, a 
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polynucleotide of the invention is administered via a mucosal 
surface, e.g., an ocular, intranasal, pulmonary, oral, 
intestinal, rectal, vaginal, and urinary tract surface; or via a 
parenteral route, e.g., by an intravenous, subcutaneous, 
intraperitoneal, intradermal, intraepidermal , or intramuscular 
route. The choice of administration route depends on the 
formulation that is selected. A polynucleotide formulated in 
association with bupivacaine is advantageously administered into 
muscles. When a neutral or anionic liposome or a cationic 
lipid, such as DOTMA or DC-Choi, is used, the formulation can be 
advantageously injected via intravenous, intranasal 
(aerosolization) , intramuscular, intradermal, and subcutaneous 
routes. A polynucleotide in a naked form can advantageously be 
administered via the intramuscular, intradermal, or sub- 
cutaneous routes . 

Although not absolutely required, such a composition can 
also contain an adjuvant. If so, a systemic adjuvant that does 
not require concomitant administration in order to exhibit an 
adjuvant effect is preferable such as, e.g., QS21, which is 
described in U.S. Patent No. 5,057,546. 

The sequence information provided in the present 
application enables the design of specific nucleotide probes and 
primers that are used for diagnostic purposes. Accordingly, a 
fifth aspect of the invention provides a nucleotide probe or 
primer having a sequence found in or derived by degeneracy of 
the genetic code from a sequence shown in any one of SEQ ID 
Nos:l to 26. 

The term "probe" as used in the present application 
refers to DNA (preferably single stranded) or RNA molecules (or 
modifications or combinations thereof) that hybridize under the 
stringent conditions, as defined above, to nucleic acid 
molecules having SEQ ID Nos: 1 to 26 or to sequences homologous 
to SEQ ID Nos:l to 26, or to their complementary or anti-sense 
sequences. Generally, probes are significantly shorter than 
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full-length sequences . Such probes contain from about 5 to 
about 100, preferably from about 10 to about 80, nucleotides. 
In particular, probes have sequences that are at least 75%, 
preferably at least 85%, more preferably 95% homologous to a 
5 portion of any of SEQ ID Nos : 1 to 26 or that are complementary 
to such sequences. Probes may contain modified bases such as 
inosine, methyl -5-deoxycytidine, deoxyuridine, dimethylamino-5- 
deoxyuridine, or diamino-2, 6-purine. Sugar or phosphate 
residues may also be modified or substituted. For example, a 

10 deoxyribose residue may be replaced by a polyamide (Nielsen et 
al., Science (1991) 254:1497) and phosphate residues may be 
replaced by ester groups such as diphosphate, alkyl, 
arylphosphonate and phosphorothioate esters. In addition, the 
2 1 -hydroxyl group on ribonucleotides may be modified by 

15 including such groups as alkyl groups. 

Probes of the invention are used in diagnostic tests, as 
capture or detection probes. Such capture probes are 
conventionally immobilized on a solid support, directly or 
indirectly, by covalent means or by passive adsorption. A 

20 detection probe is labelled by a detection marker selected from: 
radioactive isotopes, enzymes such as peroxidase, alkaline 
phosphatase, and enzymes able to hydrolyze a chromogenic, 
fluorogenic, or luminescent substrate, compounds that are 
chromogenic, fluorogenic, or luminescent, nucleotide base 

25 analogs, and biotin. 

Probes of the invention are used in any conventional 
hybridization technique, such as dot blot (Maniatis et al., 
Molecular Cloning: A Laboratory Manual (1982) Cold Spring Harbor 
Laboratory Press, Cold Spring Harbor, New York) , Southern blot 

30 (Southern, J. Mol. Biol. (1975) 98:503), northern blot 

(identical to Southern blot with the exception that RNA is used 
as a target), or the sandwich technique (Dunn et al., Cell 
(1977) 12:23) . The latter technique involves the use of a 
specific capture probe and/or a specific detection probe with 
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nucleotide sequences that at least partially differ from each 
other . 

A primer is a probe of usually about 10 to about 
40 nucleotides that is used to initiate enzymatic polymerization 
5 of DNA in an amplification process (e.g., PCR) , in an elongation 
process, or in a reverse transcription method. Primers used in 
diagnostic methods involving PCR are labeled by methods known in 
the art. 

As described herein, the invention also encompasses (i) a 

10 reagent comprising a probe of the invention for detecting and/or 
identifying the presence of Chlamydia in a biological material; 
(ii) a method for detecting and/or identifying the presence of 
Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA or 

15 RNA is extracted from the material and denatured, and (c) 

exposed to a probe of the invention, for example, a capture, 
detection probe or both, under stringent hybridization 
conditions, such that hybridization is detected; and (iii) a 
method for detecting and/or identifying the presence of 

20 Chlamydia in a biological material, in which (a) a sample is 
recovered or derived from the biological material, (b) DNA is 
extracted therefrom, (c) the extracted DNA is primed with at 
least one, and preferably two, primers of the invention and 
amplified by polymerase chain reaction, and (d) the amplified 

25 DNA fragment is produced. 

It is apparent that disclosure of polynucleotide 
se q Uences 0 f SEQ ID Nos: 1 to 26, their homolog, and partial 
sequences of either enable their corresponding amino acid 
sequences. Accordingly, a sixth aspect of the invention 

30 features a substantially purified polypeptide or polypeptide 
derivative having an amino acid sequence encoded by a 
polynucleotide of the invention. 

A "substantially purified polypeptide" as used herein is 
defined as a polypeptide that is separated from the environment 
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in which it naturally occurs and/or that is free of the majority 
of the polypeptides that are present in the environment in which 
it was synthesized. For example, a substantially purified 
polypeptide is free from cytoplasmic polypeptides. Those 
5 skilled in the art would readily understand that the 

polypeptides of the invention may be purified from a natural 
source, i.e., a Chlamydia strain, or produced by recombinant 
means . 

Consistent with the sixth aspect of the invention are 

10 polypeptides, homologs or fragments which are modified or 

treated to enhance their immunogenicity in the target animal, in 
whom the polypeptide, homolog or fragments are intended to 
confer protection against Chlamydia. Such modifications or 
treatments include: amino acid substitutions with an amino acid 

15 derivative such as 3-methyhistidine, 4-hydroxyproline, 5- 

hydroxylysine etc., modifications or deletions which are carried 
out after preparation of the polypeptide, homolog or fragment, 
such as the modification of free amino, carboxyl or hydroxyl 
side groups of the amino acids. 

20 Identification of homologous polypeptides or polypeptide 

derivatives encoded by polynucleotides of the invention which 
have specific antigenicity is achieved by screening for cross- 
reactivity with an antiserum raised against the polypeptide of 
reference having an amino acid sequence of any one of SEQ ID 

25 Nos: 27 to 45. The procedure is as follows: a monospecific 
hyperimmune antiserum is raised against a purified reference 
polypeptide, a fusion polypeptide (for example, an expression 
product of MBP, GST, or His-tag systems), or a synthetic peptide 
predicted to be antigenic. Where an antiserum is raised 

30 against a fusion polypeptide, two different fusion systems are 
employed. Specific antigenicity can be determined according to 
a number of methods, including Western blot (Towbin et al., 
Proc. Natl. Acad. Sci. USA (1979) 76:4350), dot blot, and ELISA, 
as described below. 
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In a Western blot assay, the product to be screened, 
either as a purified preparation or a total E . coll extract, is 
submitted to SDS-Page electrophoresis as described by Laemmli 
(Nature (1970) 227:680). After transfer to a nitrocellulose 
membrane, the material is further incubated with the 
monospecific hyperimmune antiserum diluted in the range of 
dilutions from about 1 : 5 to about 1:5000, preferably from about 
1:100 to about 1:500. Specific antigenicity is shown once a 
band corresponding to the product exhibits reactivity at any of 
the dilutions in the above range. 

In an ELISA assay, the product to be screened is 
preferably used as the coating antigen. A purified preparation 
is preferred, although a whole cell extract can also be used. 
Briefly, about 100 \il of a preparation at about 10 pg protein/ml 
are distributed into wells of a 96-well polycarbonate ELISA 
plate. The plate is incubated for 2 hours at 37 °C then 
overnight at 4°C. The plate is washed with phosphate buffer 
saline (PBS) containing 0.05% Tween 20 (PBS/Tween buffer). The 
wells are saturated with 250 pi PBS containing 1% bovine serum 
albumin (BSA) to prevent non-specific antibody binding. After 1 
hour incubation at 37 °C, the plate is washed with PBS/Tween 
buffer. The antiserum is serially diluted in PBS/Tween buffer 
containing 0.5% BSA. 100 pi of dilutions are added per well. 
The plate is incubated for 90 minutes at 37 °C, washed and 
evaluated according to standard procedures. For example, a goat 
anti-rabbit peroxidase conjugate is added to the wells when 
specific antibodies were raised in rabbits. Incubation is 
carried out for 90 minutes at 37°C and the plate is washed. The 
reaction is developed with the appropriate substrate and the 
reaction is measured by colorimetry (absorbance measured 
spectrophotometrically) . Under the above experimental 
conditions, a positive reaction is shown by O.D. values greater 
than a non immune control serum. 
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In a dot blot assay, a purified product is preferred, 
although a whole cell extract can also be used. Briefly, a 
solution of the product at about 100 pg/ml is serially two-fold 
diluted in 50 mM Tris-HCl (pH 7.5). 100 \il of each dilution are 
5 applied to a nitrocellulose membrane 0.45 ]im set in a 96-well 
dot blot apparatus (Biorad) . The buffer is removed by applying 
vacuum to the system. Wells are washed by addition of 50 mM 
Tris-HCl (pH 7.5) and the membrane is air-dried. The membrane 
is saturated in blocking buffer (50 mM Tris-HCl (pH 7.5) 0.15 M 
10 NaCl, 10 g/L skim milk) and incubated with an antiserum dilution 
from about 1:50 to about 1:5000, preferably about 1:500. The 
reaction is revealed according to standard procedures. For 
example, a goat anti-rabbit peroxidase conjugate is added to the 
wells when rabbit antibodies are used. Incubation is carried 
15 out 90 minutes at 37 °C and the blot is washed. The reaction is 
developed with the appropriate substrate and stopped. The 
reaction is measured visually by the appearance of a colored 
spot, e.g., by colorimetry. Under the above experimental 
conditions, a positive reaction is shown once a colored spot is 
20 associated with a dilution of at least about 1:5, preferably of 
at least about 1:500. 

Therapeutic or prophylactic efficacy of a polypeptide or 
derivative of the invention can be evaluated as described below. 
A seventh aspect of the invention provides (i) a composition of 
25 matter comprising a polypeptide of the invention together with a 
diluent or carrier; specifically (ii) a pharmaceutical 
composition containing a therapeutically or prophylactically 
effective amount of a polypeptide of the invention; (iii) a 
method for inducing an immune response against Chlamydia in a 
30 mammal, by administering to the mammal an immunogenically 

effective amount of a polypeptide of the invention to elicit a 
protective immune response to Chlamydia; and particularly, (iv) 
a method for preventing and/or treating a Chlamydia (e.g., C. 
trachomatis . C. psittaci, C. pneumoniae, or C. pecorum) 
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infection, by administering a prophylactic or therapeutic amount 
of a polypeptide of the invention to an infected individual. 
Additionally, the seventh aspect of the invention encompasses 
the use of a polypeptide of the invention in the preparation of 
5 a medicament for preventing and/or treating Chlamydia infection. 
As used herein, the immunogenic compositions of the 
invention are administered by conventional routes known the 
vaccine field, in particular to a mucosal (e.g., ocular, 
intranasal, pulmonary, oral, gastric, intestinal, rectal, 

10 vaginal, or urinary tract) surface or via the parenteral (e.g., 
subcutaneous, intradermal, intramuscular, intravenous, or 
intraperitoneal) route. The choice of administration route 
depends upon a number of parameters, such as the adjuvant 
associated with the polypeptide. If a mucosal adjuvant is used, 

15 the intranasal or oral route is preferred. If a lipid 

formulation or an aluminum compound is used, the parenteral 
route is preferred with the sub-cutaneous or intramuscular route 
being most preferred. The choice also depends upon the nature 
of the vaccine agent. For example, a polypeptide of the 

20 invention fused to CTB or LTB is best administered to a mucosal 
surface . 

As used herein, the composition of the invention contains 
one or several polypeptides or derivatives of the invention. 
The composition optionally contains at least one additional 

25 Chlamydia antigen, or a subunit, fragment, homolog, mutant, or 
derivative thereof . 

For use in a composition of the invention, a polypeptide 
or derivative thereof is formulated into or with liposomes, 
preferably neutral or anionic liposomes, microspheres, ISCOMS, 

30 or virus-like-particles (VLPs) to facilitate delivery and/or 
enhance the immune response. These compounds are readily 
available to one skilled in the art; for example, see Liposomes: 
A Practical Approach (supra) . 
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Adjuvants other than liposomes and the like are also used 
and- are known in the art. Adjuvants may protect the antigen 
from rapid dispersal by sequestering it in a local deposit, or 
they may contain substances that stimulate the host to secrete 
5 factors that are chemotactic for macrophages and other 

components of the immune system. An appropriate selection can 
conventionally be made by those skilled in the art, for example, 
from those described below. 

Treatment is achieved in a single dose or repeated as 

10 necessary at intervals, as can be determined readily by one 

skilled in the art. For example, a priming dose is followed by 
three booster doses at weekly or monthly intervals. An 
appropriate dose depends on various parameters including the 
recipient (e.g., adult or infant), the particular vaccine 

15 antigen, the route and frequency of administration, the 

presence/absence or type of adjuvant, and the desired effect 
(e.g., protection and/or treatment), as can be determined by one 
skilled in the art. In general, a vaccine antigen of the 
invention is administered by a mucosal route in an amount from 

20 about 10 \ig to about 500 mg, preferably from about 1 mg to about 
200 mg. For the parenteral route of administration, the dose 
usually does not exceed about 1 mg, preferably about 100 jag. 

When used as vaccine agents, polynucleotides and 
polypeptides of the invention may be used sequentially as part 

25 of a multistep immunization process. For example, a mammal is 
initially primed with a vaccine vector of the invention such as 
a pox virus, e.g., via the parenteral route, and then boosted 
twice with the polypeptide encoded by the vaccine vector, e.g., 
via the mucosal route. In another example, liposomes associated 

30 with a polypeptide or derivative of the invention is also used 
for priming, with boosting being carried out mucosally using a 
soluble polypeptide or derivative of the invention in 
combination with a mucosal adjuvant (e.g., LT) . 
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A polypeptide derivative of the invention is also used in 
accordance with the seventh aspect as a diagnostic reagent for 
detecting the presence of anti-Chla/nydia antibodies, e.g., in a 
blood sample. Such polypeptides are about 5 to about 8 0 , 
5 preferably about 10 to about 50 amino acids in length. They are 
either labeled or unlabeled, depending upon the diagnostic 
method. Diagnostic methods involving such a reagent are 
described below. 

Upon expression of a DNA molecule of the invention, a 

10 polypeptide or polypeptide derivative is produced and purified 
using known laboratory techniques. As described above, the 
polypeptide or polypeptide derivative may be produced as a 
fusion protein containing a fused tail that facilitates 
purification. The fusion product is used to immunize a small 

15 mammal, e.g., a mouse or a rabbit, in order to raise antibodies 
against the polypeptide or polypeptide derivative (monospecific 
antibodies) . Accordingly, an eighth aspect of the invention 
provides a monospecific antibody that binds to a polypeptide or 
polypeptide derivative of the invention. 

20 By "monospecific antibody" is meant an antibody that is 

capable of reacting with a unique naturally-occurring Chlamydia 
polypeptide. An antibody of the invention is either polyclonal 
or monoclonal. Monospecific antibodies may be recombinant, 
e.g., chimeric (e.g., constituted by a variable region of murine 

25 origin associated with a human constant region) , humanized (a 
human immunoglobulin constant backbone together with 
hypervariable region of animal, e.g., murine, origin), and/or 
single chain. Both polyclonal and monospecific antibodies may 
also be in the form of immunoglobulin fragments, e.g., F(ab) f 2 

30 or Fab fragments. The antibodies of the invention are of any 
isotype, e.g., IgG or IgA, and polyclonal antibodies are of a 
single isotype or a mixture of isotypes. 

Antibodies against the polypeptides, homologs or 
fragments of the present invention are generated by immunization 
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of a mammal with a composition comprising said polypeptide, 
homolog or fragment. Scu antibodies may be polyclonal or 
monoclonal. Methods to produce polyclonal or monoclonal 
antibodies are well known in the art. For a review, see 
5 "Antibodies, A Laboratory Manual, Cold Spring Harbor Laboratory, 
Eds. E. Harlow and D. Lane (1988), and D.E. Yelton et al., 1981. 
Ann. Rev. Biochem. 50:657-680. For monoclonal antibodies, see 
Kohl and Milstein?... 

The antibodies of the invention, which are raised to a 

10 polypeptide or polypeptide derivative of the invention, are 
produced and identified using standard immunological assays, 
e.g., Western blot analysis, dot blot assay, or ELISA (see, 
e.g., Coligan et al., Current Protocols in Immunology (1994) 
John Wiley & Sons, Inc., New York, NY). The antibodies are used 

15 in diagnostic methods to detect the presence of a Chlamydia 
antigen in a sample, such as a biological sample. The 
antibodies are also used in affinity chromatography for 
purifying a polypeptide or polypeptide derivative of the 
invention. As is discussed further below, such antibodies may 

20 be used in prophylactic and therapeutic passive immunization 
methods . 

Accordingly, a ninth aspect of the invention provides 
(i) a reagent for detecting the presence of Chlamydia in a 
biological sample that contains an antibody, polypeptide, or 

25 polypeptide derivative of the invention; and (ii) a diagnostic 
method for detecting the presence of Chlamydia in a biological 
sample, by contacting the biological sample with an antibody, a 
polypeptide, or a polypeptide derivative of the invention, such 
that an immune complex is formed, and by detecting such complex 

30 to indicate the presence of Chlamydia in the sample or the 
organism from which the sample is derived. 

Those skilled in the art will readily understand that the 
immune complex is formed between a component of the sample and 
the antibody, polypeptide, or polypeptide derivative, whichever 
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is used, and that any unbound material is removed prior to 
detecting the complex. It is understood that a polypeptide 
reagent is useful for detecting the presence of anti-Chlamydia 
antibodies in a sample, e.g., a blood sample, while an antibody 
5 of the invention is used for screening a sample, such as a 
gastric extract or biopsy, for the presence of Chlamydia 
polypeptides . 

For diagnostic applications, the reagent (i.e., the 
antibody, polypeptide, or polypeptide derivative of the 

10 invention) is either in a free state or immobilized on a solid 
support, such as a tube, a bead, or any other conventional 
support used in the field. Immobilization is achieved using 
direct or indirect means. Direct means include passive 
adsorption (non-covalent binding) or covalent binding between 

15 the support and the reagent. By "indirect means" is meant that 
an anti-reagent compound that interacts with a reagent is first 
attached to the solid support. For example, if a polypeptide 
reagent is used, an antibody that binds to it can serve as an 
anti-reagent, provided that it binds to an epitope that is not 

20 involved in the recognition of antibodies in biological samples. 
Indirect means may also employ a ligand-receptor system, for 
example, where a molecule such as a vitamin is grafted onto the 
polypeptide reagent and the corresponding receptor immobilized 
on the solid phase. This is illustrated by the biotin- 

25 streptavidin system. Alternatively, a peptide tail is added 
chemically or by genetic engineering to the reagent and the 
grafted or fused product immobilized by passive adsorption or 
covalent linkage of the peptide tail. 

Such diagnostic agents may be included in a kit which 

30 also comprises instructions for use. The reagent are labeled 
with a detection means which allows for the detection of the 
reagent when it is bound to its target. The detection means may 
be a fluorescent agent such as fluorescein isocyanate or 
fluorescein isothiocyanate, or an enzyme such as horse radish 
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peroxidase or luciferase or alkaline phosphatase, or a 
radioactive element such as 125 I or 51 Cr. 

Accordingly, a tenth aspect of the invention provides a 
process for purifying, from a biological sample, a polypeptide 
5 or polypeptide derivative of the invention, which involves 
carrying out antibody-based affinity chromatography with the 
biological sample, wherein the antibody is a monospecific 
antibody of the invention. 

For use in a purification process of the invention, the 

10 antibody is either polyclonal or monospecific, and preferably is 
of the IgG type. Purified IgGs is prepared from an antiserum 
using standard methods (see, e.g., Coligan et al., supra). 
Conventional chromatography supports, as well as standard 
methods for grafting antibodies, are described in, e.g., 

15 Antibodies: A Laboratory Manual, D. Lane, E. Harlow, Eds. (1988) 
and outlined below. 

Briefly, a biological sample, such as an C. pneumoniae 
extract preferably in a buffer solution, is applied to a 
chromatography material, preferably equilibrated with the buffer 

20 used to dilute the biological sample so that the polypeptide or 
polypeptide derivative of the invention (i.e., the antigen) is 
allowed to adsorb onto the material. The chromatography 
material, such as a gel or a resin coupled to an antibody of the 
invention, is in either a batch form or a column. The unbound 

25 components are washed off and the antigen is then eluted with an 
appropriate elution buffer, such as a glycine buffer or a buffer 
containing a chaotropic agent, e.g., guanidine HC1, or hi.gh salt 
concentration (e.g., 3 M MgCl 2 ) - Eluted fractions are recovered 
and the presence of the antigen is detected, e.g., by measuring 

30 the absorbance at 280 nm. 

An eleventh aspect of the invention provides (i) a 
composition of matter comprising a monospecific antibody of the 
invention, together with a diluent or carrier; (ii) a 
pharmaceutical composition, comprising a therapeutically or 
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prophylactically effective amount of a monospecific antibody of 
the invention, and (iii) a method for treating or preventing a 
Chlamydia (e.g., C. trachomatis, C. psittaci, C. pneumoniae or 
C. pecorum) infection, by administering a therapeutic or 
5 prophylactic amount of a monospecific antibody of the invention 
to an infected individual. Additionally, the eleventh aspect of 
the invention encompasses the use of a monospecific antibody of 
the invention in the preparation of a medicament for treating or 
preventing Chlamydia infection. 

10 The monospecific antibody is either polyclonal or 

monoclonal, preferably of the IgA isotype (predominantly) . In 
passive immunization, the antibody is administered to a mucosal 
surface of a- mammal, e.g., the gastric mucosa, e.g., orally or 
intragastrically , advantageously, in the presence of a 

15 bicarbonate buffer. Alternatively, systemic administration, not 
requiring a bicarbonate buffer, is carried out. A monospecific 
antibody of the invention is administered as a single active 
component or as a mixture with at least one monospecific 
antibody specific for a different Chlamydia polypeptide. The 

20 amount of antibody and the particular regimen used are readily 
determined by one skilled in the art. For example, daily 
administration of about 100 to 1,000 mg of antibodies over one 
week, or three doses per day of about 100 to 1,000 mg of 
antibodies over two or three days, are effective regimens for 

25 most purposes. 

Therapeutic or prophylactic efficacy are evaluated using 
standard methods in the art, e.g., by measuring induction of a 
mucosal immune response or induction of protective and/or 
therapeutic immunity, using, e.g., the C. pneumoniae mouse 

30 model. Those skilled in the art will readily recognize that the 
C. pneumoniae strain of the model may be replaced with another 
Chlamydia strain. For example, the efficacy of DNA molecules 
and polypeptides from C. pneumoniae is preferably evaluated in a 
mouse model using C. pneumoniae strain. Protection is 
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determined by comparing the degree of Chlamydia infection to 
that of a control group. Protection is shown when infection is 
reduced by comparison to the control group. Such an evaluation 
is made for polynucleotides, vaccine vectors, polypeptides and 
5 derivatives thereof, as well as antibodies of the invention. 

Adjuvants useful in any of the vaccine compositions 
described above are as follows. 

Adjuvants for parenteral administration include aluminum 
compounds, such as aluminum hydroxide, aluminum phosphate, and 

10 aluminum hydroxy phosphate. The antigen is precipitated with, 
or adsorbed onto, the aluminum compound according to standard 
protocols. Other adjuvants, such as RIBI (ImmunoChem, Hamilton, 
MT) , is used in parenteral administration. 

Adjuvants for mucosal administration include bacterial 

15 toxins, e.g., the cholera toxin (CT) , the E. coli heat-labile 
toxin (LT) , the Clostridium difficile toxin A and the pertussis 
toxin (PT) , or combinations, subunits, toxoids, or mutants 
thereof such as a purified preparation of native cholera toxin 
subunit B (CTB) . Fragments, homologs, derivatives, and fusions 

20 to any of these toxins are also suitable, provided that they 
retain adjuvant activity. Preferably, a mutant having reduced 
toxicity is used. Suitable mutants are described, e.g., in WO 
95/17211 (Arg-7-Lys CT mutant), WO 96/6627 (Arg-192-Gly LT 
mutant), and WO 95/34323 (Arg-9-Lys and Glu-129-Gly PT mutant). 

25 Additional LT mutants that are used in the methods and 

compositions of the invention include, e.g., Ser-63-Lys, Ala-69- 
Gly, Glu-110-Asp, and Glu-112-Asp mutants. Other adjuvants, 
such as a bacterial monophosphoryl lipid A (MPLA) of, e.g., £. 
coli, Salmonella minnesota , Salmonella typhimurium, or Shigella 

30 flexneri; saponins, or polylactide glycolide (PLGA) 

microspheres, is also be used in mucosal administration. 

Adjuvants useful for both mucosal and parenteral 
administrations include polyphosphazene (WO 95/2415), DC-chol (3 
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b- (N- (N ' , N 1 -dimethyl aminomethane) -carbamoyl ) cholesterol; U.S. 
Patent No. 5,283,185 and WO 96/14831) and QS-21 (WO 88/9336). 

Any pharmaceutical composition of the invention 
containing a polynucleotide, a polypeptide, a polypeptide 
5 derivative, or an antibody of the invention, is manufactured in 
a conventional manner. In particular, it is formulated with a 
pharmaceutically acceptable diluent or carrier, e.g., water or a 
saline solution such as phosphate buffer saline . In general , a 
diluent or carrier is selected on the basis of the mode and 

10 route of administration, and standard pharmaceutical practice. 
Suitable pharmaceutical carriers or diluents, as well as 
pharmaceutical necessities for their use in pharmaceutical 
formulations, are described in Remington ' s Pharmaceutical 
Sciences, a standard reference text in this field and in the 

15 USP/NF. 

The invention also includes methods in which Chlamydia 
infection are treated by oral administration of a Chlamydia 
polypeptide of the invention and a mucosal adjuvant, in 
combination with an antibiotic, an antacid, sucralfate, or a 

20 combination thereof. Examples of such compounds that can be 
administered with the vaccine antigen and the adjuvant are 
antibiotics, including, e.g. , macrolides, tetracyclines, and 
derivatives thereof (specific examples of antibiotics that can 
be used include azithromycin or doxicyclin or immunomodulators 

25 such as cytokines or steroids) . In addition, compounds 

containing more than one of the above-listed components coupled 
together, are used. The invention also includes compositions 
for carrying out these methods, i.e., compositions containing a 
Chlamydia antigen (or antigens) of the invention, an adjuvant, 

30 and one or more of the above-listed compounds, in a 
pharmaceutically acceptable carrier or diluent. 

Amounts of the above-listed compounds used in the methods 
and compositions of the invention are readily determined by one 
skilled in the art. Treatment /immunization schedules are also 
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known and readily designed by one skilled in the art. For 
example, the non-vaccine components can be administered on days 
1-14, and the vaccine antigen + adjuvant can be administered on 
days 7, 14, 21, and 28. 
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CLAIMS 



5 



10 



2. 

15 



20 



A nucleic acid molecule comprising a nucleic acid 
sequence which encodes a polypeptide selected from any of: 

(a) SEQ ID Nos: 27 to 45; 

(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a) ; and 

(c) a polypeptide of (a) or (b) which has been modified to 
improve its immunogenicity , wherein said modified 
polypeptide is at least 75% identical in amino acid 
sequence to the corresponding polypeptide of (a) or 



A nucleic acid molecule comprising a nucleic acid 
sequence selected from any of: 
(a) SEQ ID Nos: 1 to 26; 

<b) a sequence which encodes a polypeptide encoded by any 
one of SEQ ID Nos: 1 to 26; 

(c) a sequence comprising at least 38 consecutive 
nucleotides from any one of the nucleic acid sequences 
of (a) and (b) ; and 

(d) a sequence which encodes a polypeptide which is at 
least 75% identical in amino acid sequence to any one 
of the polypeptides encoded by SEQ ID Nos: 1 to 26. 
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A nucleic acid molecule comprising a nucleic acid 
sequence which encodes a fusion protein, said fusion 
protein comprising a polypeptide encoded by a nucleic acid 
molecule according to claim 1 and an additional 
polypeptide . 

A nucleic acid molecule according to claim 1, 
operatively linked to one or more expression control 
sequences . 



A vaccine comprising at least one first nucleic acid 
according to any one of claims 1 to 4 and a vaccine vector 
wherein each first nucleic acid is expressed as a 
polypeptide, the vaccine optionally comprising a second 
nucleic acid encoding an additional polypeptide which 
enhances the immune response to the polypeptide expressed 
by said first nucleic acid. 



The vaccine of claim 5 wherein the second nucleic aci 
encodes an additional Chlamydia polypeptide. 



A pharmaceutical composition comprising a nucleic aci 
according to any one of claims 1 to 5 and a 
pharmaceutically acceptable carrier . 
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8. A pharmaceutical composition comprising a vaccine 

according to claim 5 or 6 and a pharmaceutically acceptable 
carrier . 

5 9. A unicellular host transformed with the nucleic acid 

molecule of claim 4 . 

10. A nucleic acid probe -of 5 to 100 nucleotides which 
hybridizes under stringent conditions to any one of nucleic 

10 acid molecules of SEQ ID Nos: 1 to 26, or to a homolog or 

complementary or anti-sense sequence of said nucleic acid 
molecule . 

11. A primer of 10 to 40 nucleotides which hybridizes 
15 under stringent conditions to any one of nucleic acid 

molecules of SEQ ID Nos: 1 to 26, or to a homolog or 
complementary or anti-sense sequence of said nucleic acid 
molecule . 

20 12. A polypeptide encoded by a nucleic acid sequence 

according to any one of claims 1 to 4 . 

13. A polypeptide comprising an amino acid sequence 

selected from any of: 
25 (a) SEQ ID Nos: 27 to 45; 
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(b) an immunogenic fragment comprising at least 12 
consecutive amino acids from a polypeptide of (a) ; and 

(c) a polypeptide of (a) or (b) which has been modified to 
improve its immunogenicity , wherein said modified 

5 polypeptide is at least 75% identical in amino acid 

sequence to the corresponding polypeptide of (a) or 
(b) . 



14. A fusion polypeptide comprising a polypeptide of claim 

10 12 or 13 and an additional polypeptide. 



15. A method for producing a polypeptide of claim 12 or 
13, comprising the step of culturing a unicellular host 
according to claim 9. 

15 

16. An antibody against the polypeptide of any one of 
claims 12 to 14 . 



17. A vaccine comprising at least one first polypeptide 

20 according to any one of claims 12 to 14 and a 

pharmaceutically acceptable carrier, optionally comprising 
a second polypeptide which enhances the immune response to 
the first polypeptide. 



25 18. 



The vaccine of claim 17 wherein the second polypeptide 
comprises. an additional Chlamydia polypeptide. 
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A pharmaceutical composition comprising a polypeptide 
according to any one of claims 12 to 14 and a 
pharmaceutically acceptable carrier. 

A pharmaceutical composition comprising a vaccine 
according to claim 17 or 18 and a pharmaceutically 
acceptable carrier . 

A pharmaceutical composition comprising an antibody 
according to claim 16 and a pharmaceutically acceptable 
carrier. 

A method for preventing or treating Chlamydia 
infection using: 

(a) the nucleic acid of any one of claims 1 to 4; 

(b) the vaccine of any one of claims 5, 6, 17 and 18; 

(c) the pharmaceutical composition of any one of claims 7, 
8, 19 to 21; 

(d) the polypeptide of any one of claims 12 to 14; or 

(e) the antibody of claim 16. 

A method of detecting Chlamydia infection comprising 
the step of assaying a body fluid of a mammal to be tested, 
with a component selected from any one of: 
(a) the nucleic acid of any one of claims 1 to 4; 
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(b) the polypeptide of any one of claims 12 to 14; and 

(c) the antibody of claim 16. 



4. A diagnostic kit comprising instructions for use and 

component selected from any one of: 

(a) the nucleic acid of any one of claims 1 to 4; 

(b) the polypeptide of any one of claims 12 to 14; and 
the antibody of claim 16. 
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attttaacgt gcgtatcatt tgtgactaag agatagactt gctttcttta tctatcttct 60 

gtattggaaa gaaagcccct tgagggaaaa aaaggttgtt atg aag att cca etc 115 

Met Lys lie Pro Leu 

cgc ttt tta ttg ata tea tta gta cct acg ctt tct atg teg aat tta 163 
Arg Phe Leu Leu lie Ser Leu Val Pro Thr Leu Ser Met Ser Asn Leu 
10 15 20 

tta gga get get act ace gaa gag tta teg get age aat age ttc gat 
Leu Gly Ala Ala Thr Thr Glu Glu Leu Ser Ala Ser Asn Ser Phe Asp 
25 30 35 

gga act aca tea aca aca age ttt tct agt aaa aca tea teg get aca 
Gly Thr Thr Ser Thr Thr Ser Phe Ser Ser Lys Thr Ser Ser Ala Thr 
40 45 50 

gat ggc ace aat tat gtt ttt aaa gat tct gta gtt ata gaa aat gta 
Asp Gly Thr Asn Tyr Val Phe Lys Asp Ser Val Val lie Glu Asn Val 
55 60 65 

ccc aaa aca ggg gaa act cag tct act agt tgt ttt aaa aat gac get 
Pro Lys Thr Gly Glu Thr Gin Ser Thr Ser Cys Phe Lys Asn Asp Ala 
70 75 80 85 

gca get gga gat eta aat ttc tta gga ggg gga ttt tct ttc aca ttt 403 
Ala Ala Gly Asp Leu Asn Phe Leu Gly Gly Gly Phe Ser Phe Thr Phe 
90 95 100 



age aat ate gat gca ace acg get tct gga get get att gga agt gaa 
Ser Asn He Asp Ala Thr Thr Ala Ser Gly Ala Ala He Gly Ser Glu 
105 HO H5 

gca get aat aag aca gtc acg tta tea gga ttt teg gca ctt tct ttt 
Ala Ala Asn Lys Thr Val Thr Leu Ser Gly Phe Ser Ala Leu Ser Phe 
120 125 130 

ctt aaa tec cca gca agt aca gtg act aat gga ttg gga get ate aat 
Leu Lys Ser Pro Ala Ser Thr Val Thr Asn Gly Leu Gly Ala He Asn 
135 140 145 

gtt aaa ggg aat tta age eta ttg gat aat gat aag gta ttg att cag 
Val Lys Gly Asn Leu Ser Leu Leu Asp Asn Asp Lys Val Leu He Gin 
150 155 160 165 

gac aat ttc tea aca gga gat ggc gga gca att aat tgt gca ggc tec 
Asp Asn Phe Ser Thr Gly Asp Gly Gly Ala He Asn Cys Ala Gly Ser 
170 175 180 

ttg aag ate gca aac aat aag tec ctt tct ttt att gga aat agt tct 
Leu Lys He Ala Asn Asn Lys Ser Leu Ser Phe He Gly Asn Ser Ser 
185 190 195 



211 



259 



307 



355 



451 



499 



547 



595 



643 



691 
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Fig. 1 (con't) 

tea aca cgt 
Ser Thr Arg 
200 

ggt ggg gaa 
Gly Gly Glu 
215 

aaa gga ggt 
Lys Gly Gly 
230 

gga gac agt 
Gly Asp Ser 



gga acc gtc 
Gly Thr Val 



act gcg tta 
Thr Ala Leu 
280 

att act gta 
lie Thr Val 
295 

age cct gat 
Ser Pro Asp 
310 

tct gga gag 
Ser Gly Glu 



act tct aaa 
Thr Ser Lys 



tta aaa ggt 
Leu Lys Gly 
360 

aac tct aag 
Asn Ser Lys 
375 

gaa agt ate 
Glu Ser lie 
390 



ggc gga gcg 
Gly Gly Ala 



act eta ttt 
Thr Leu Phe 



get ate gcg 
Ala lie Ala 
235 

ggc gac att 
Gly Asp lie 
250 

tct cat agt 
Ser His Ser 
265 

cgt get gcg 
Arg Ala Ala 



aca gga teg 
Thr Gly Ser 



act gga gat 
Thr Gly Asp 
315 

aag etc acg 
Lys Leu Thr 
330 

tta ctt caa 
Leu Leu Gin 
345 

gat gtc gtt 
Asp Val Val 



ttg att atg 
Leu lie Met 



gag tta acg 
Glu Leu Thr 
395 



att cat acc 
He His Thr 
205 

cag ggg aat 
Gin Gly Asn 
220 

att gca gac 
lie Ala Asp 



ate ttt gaa 
He Phe Glu 



get att gat 
Ala He Asp 
270 

caa gga cat 
Gin Gly His 
285 

aca tct gtt 
Thr Ser Val 
300 

aac aaa gag 
Asn Lys Glu 



gag gca gaa 
Glu Ala Glu 



aat gtt get 
Asn Val Ala 
350 

tta agt gcg 
Leu Ser Ala 
365 

gat tta ggg 
Asp Leu Gly 
380 

aat ttg gaa 
Asn Leu Glu 



aaa aac etc 
Lys Asn Leu 



aca gcg cct 
Thr Ala Pro 
225 

tct ggc acc 
Ser Gly Thr 
240 

ggc aat acg 
Gly Asn Thr 
255 

tta gga act 
Leu Gly Thr 



acg ata tac 
Thr He Tyr 



get gat get 
Ala Asp Ala 
305 

tat acg gga 
Tyr Thr Gly 
320 

get aaa gat 
Ala Lys Asp 
335 

ttt aaa aat 
Phe Lys Asn 



aac ggt ttc 
Asn Gly Phe 



acg teg ttg 
Thr Ser Leu 
385 

att aat ata 
He Asn He 
400 



aca eta tct 
Thr Leu Ser 
210 

acg get get 
Thr Ala Ala 



eta tec att 
Leu Ser He 



ata gga get 
He Gly Ala 
260 

age get aag 
Ser Ala Lys 
275 

ttt tat gat 
Phe Tyr Asp 
290 

etc aat att 
Leu Asn He 



acc ata gtc 
Thr He Val 



gag aag aac 
Glu Lys Asn 
340 

ggg act gta 
Gly Thr Val 
355 

tct cag gat 
Ser Gin Asp 
370 

gtt gca aac 
Val Ala Asn 



gac tct etc 
Asp Ser Leu 



tct 739 
Ser 



ggt 787 
Gly 



tct 835 

Ser 

245 

aca 883 
Thr 



ata 931 
He 



ccg 979 
Pro 



aat 1027 
Asn 



ttt 1075 

Phe 

325 

cgc 1123 
Arg 



gtt 1171 
Val 



gca 1219 
Ala 



acc 1267 
Thr 



agg 1315 

Arg 

405 
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Fig. 1 (con't) 

aac ggg aaa aag ata aaa etc agt get gec aca get cag aaa gat att 1363 

Asn Gly Lys Lys lie Lys Leu Ser Ala Ala Thr Ala Gin Lys Asp lie 
410 415 420 

cgt ata gat cgt cct gtt gta ctg gca att age gat gag agt ttt tat 1411 

Arg He Asp Arg Pro Val Val Leu Ala He Ser Asp Glu Ser Phe Tyr 

425 430 435 

caa aat ggc ttt ttg aat gag gac cat tec tat gat ggg att ctt gag 1459 

Gin Asn Gly Phe Leu Asn Glu Asp His Ser Tyr Asp Gly He Leu Glu 

440 445 450 

tta gat get ggg aaa gac ate gtg att tct gca gat tct cgc agt ata 1507 

Leu Asp Ala Gly Lys Asp He Val He Ser Ala Asp Ser Arg Ser He 
455 460 465 

gat get gta caa tct ccg tat ggc tat cag gga aag tgg acg ate aat 1555 

Asp Ala Val Gin Ser Pro Tyr Gly Tyr Gin Gly Lys Trp Thr He Asn 
470 475 480 485 



tgg tct act gat gat aag aaa get acg gtt tct tgg gcg aag cag agt 
Trp Ser Thr Asp Asp Lys Lys Ala Thr Val Ser Trp Ala Lys Gin Ser 
490 495 500 



505 510 515 

tgg ggt tct ttt ata gat gtt cgt tec ttc cag aat ttt ata gag eta 

Trp Gly Ser Phe He Asp Val Arg Ser Phe Gin Asn Phe He Glu Leu 

520 525 530 



gac tac ttt atg aat acc aat ttc gca aag acc tac gca gga tct tta 
Asp Tyr Phe Met Asn Thr Asn Phe Ala Lys Thr Tyr Ala Gly Ser Leu 
600 605 610 



1603 



ttt aat ccc act get gag cag gag get ccg tta gtt cct aat ctt ctt 1651 
Phe Asn Pro Thr Ala Glu Gin Glu Ala Pro Leu Val Pro Asn Leu Leu 



1699 



ggt act gaa ggt get cct tac gaa aag aga ttt tgg gtt gca ggc att 1747 

Gly Thr Glu Gly Ala Pro Tyr Glu Lys Arg Phe Trp Val Ala Gly He 
535 540 545 

tec aat gtt ttg cat agg age ggt cgt gaa aat caa agg aaa ttc cgt 1795 

Ser Asn Val Leu His Arg Ser Gly Arg Glu Asn Gin Arg Lys Phe Arg 
550 555 560 565 

cat gtg agt gga ggt get gta gta ggt get age acg agg atg ccg ggt 1843 

His Val Ser Gly Gly Ala Val Val Gly Ala Ser Thr Arg Met Pro Gly 
570 575 580 

ggt gat acc ttg tct ctg ggt ttt get cag etc ttt gcg cgt gac aaa 1891 

Gly Asp Thr Leu Ser Leu Gly Phe Ala Gin Leu Phe Ala Arg Asp Lys 
585 590 595 



1939 
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aac ccc aaa tgt acg act acc eta ctt tec aac caa ggg agt tgg aag 
Asn Pro Lys Cys Thr Thr Thr Leu Leu Ser Asn Gin Gly Ser Trp Lys 
775 780 785 



1987 



2035 



2083 



Fig. 1 (con't) 

cgt ttg cag cac gat get tec eta tac tct gtg gtg agt ate ctt tta 

Arg Leu Gin His Asp Ala Ser Leu Tyr Ser Val Val Ser He Leu Leu 
615 620 625 

gga gag gga gga etc cgc gag ate ctg ttg cct tat gtt tec aag act 
Gly Glu Gly Gly Leu Arg Glu He Leu Leu Pro Tyr Val Ser Lys Thr 
630 635 640 645 

ctg ccg tgc tct ttc tat ggg cag ctt age tac ggc cat acg gat cat 
Leu Pro Cys Ser Phe Tyr Gly Gin Leu Ser Tyr Gly His Thr Asp His 
650 655 660 

cgc atg aag acc gag tct eta ccc ccc ccc ccc ccg acg etc teg acg 2131 
Arg Met Lys Thr Glu Ser Leu Pro Pro Pro Pro Pro Thr Leu Ser Thr 
665 670 675 

gat cat act tct tgg gga gga tat gtc tgg get gga gag ctg gga act 217 9 
Asp His Thr Ser Trp Gly Gly Tyr Val Trp Ala Gly Glu Leu Gly Thr 
680 685 690 

cga gtt get gtt gaa aat acc age ggc aga gga ttt ttc caa gag tac 2227 
Arg Val Ala Val Glu Asn Thr Ser Gly Arg Gly Phe Phe Gin Glu Tyr 
695 700 705 

act cca ttt gta aaa gtc caa get gtt tac get cgc caa gat age ttt 2275 
Thr Pro Phe Val Lys Val Gin Ala Val Tyr Ala Arg Gin Asp Ser Phe 
710 715 720 725 

gta gaa eta gga get ate agt cgt gat ttt agt gat teg cat ctt tat 2323 
Val Glu Leu Gly Ala He Ser Arg Asp Phe Ser Asp Ser His Leu Tyr 
730 735 740 

aac ctt gcg att cct ctt gga ate aag tta gag aaa egg ttt gca gag 2371 
Asn Leu Ala He Pro Leu Gly He Lys Leu Glu Lys Arg Phe Ala Glu 
745 750 755 

caa tat tat cat gtt gta gcg atg tat tct cca gat gtt tgt cgt agt 2419 
Gin Tyr Tyr His Val Val Ala Met Tyr Ser Pro Asp Val Cys Arg Ser 
760 765 770 



2467 



acc aaa ggt teg aac tta gca aga cag get ggt att gtt cag gee tea 2515 

Thr Lys Gly Ser Asn Leu Ala Arg Gin Ala Gly He Val Gin Ala Ser 

790 795 800 805 

ggt ttt cga tct ttg gga get gca gca gag ctt ttc ggg aac ttt ggc 2563 

Gly Phe Arg Ser Leu Gly Ala Ala Ala Glu Leu Phe Gly Asn Phe Gly 

810 815 820 
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Fig. 1 (corVt) 

ttt gaa tgg egg gga tct tct cgt age tat aat gta gat gcg ggt age 2611 

Phe Glu Trp Arg Gly Ser Ser Arg Ser Tyr Asn Val Asp Ala Gly Ser 
825 830 835 

aaa ate aaa ttt tagegattte tctttcgatg ctatttttcc atggctattt 2663 
Lys lie Lys Phe 
840 

ttaaaatgat agccatggtt atagatacgt agtccttatt tcaaagaaga cactgttgca 2723 
ttagataege tctctgatcc ctcaaaa 2750 
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Figure 2 (RY-32) 

Restriction Enzyme analysis of CPN 100397 

Mae I I Maelll Bael 

Msel | Tsp45I Ddel | MboII 

II III I 

ATTTTAACGTGCGTATCATTTGTGACTAAGAGATAGACTTGCTTTCTTTATCTATCTTCT 

X + + + + + + 60 

TAAAATTGCACGCATAGTAAACACTGATTCTCTATCTGAACGAAAGAAATAGATAGAAGA 

BslI 
Smll | 

CviJI | | Hinfl Acil 

Mnll | | Bce83I Tfil MboII 

III II I 

GTATTGGAAAGAAAGCCCCTTGAGGGAAAAAAAGGTTGTTATGAAGATTCCACTCCGCTT 

61 + + + + + + 120 

CATAACCTTTCTTTCGGGGAACTCCCTTTTTTTCCAACAATACTTCTAAGGTGAGGCGAA 

Apol 

Bbvl Fnu4HI 

Tsp509I Alul| 

AceIIl| CviJI | 
EcoRV Rsal Taql|| Tsel | Earl 

I I III II I 

TTTATTGATATCATTAGTACCTACGCTTTCTATGTCGAATTTATTAGGAGCTGCTACTAC 

12 1 + + + + + + 180 

AAATAACTATAGTAATCATGGATGCGAAAGATACAGCTTAAATAATCCTCGACGATGATG 



Cac8I 
Bf al 
CviJI | 
MboII | 
Bcgl Nhelj 
I I 



Taql Alul 

Alul | CviJI 

CviJI | Bed Bcgl Hindlll | Bfal 

ill i mi 



CGAAGAGTTATCGGCTAGCAATAGCTTCGATGGAACTACATCAACAACAAGCTTTTCTAG 

+ + + + + + 

GCTTCTCAATAGCCGATCGTTATCGAAGCTACCTTGATGTAGTTGTTGTTCGAAAAGATC 



Tsp509I 

NlalV | Hinfl 

Sfcl BanI | | Dral | Sf cl 

CviJI] 'Bed | I | Msel|TfiI | 

II Mil M I I 

TAAAACATCATCGGCTACAGATGGCACCAATTATGTTTTTAAAGATTCTGTAGTTATAGA 

241 + + + + + + 300 

ATTTTGTAGTAGCCGATGTCTACCGTGGTTAATACAAAAATTTCTAAGACATCAATATCT 



6/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



2 (con't) 



Rsal 



BslI 
BslI I 



Dral 
Msel 
Bbvl 
BseMII 
Bfal | 
Spel| | 
Ddel AccI i I I 



I II 



Alul 
CviJI 
Hgal 
MspAlI 
PvuII 
PstI 
Fnu4HI 
CviRI 
Tsel 
Fnu4HI | 
Sf cl | 
Tsel | | 



AAATGTACCCAAAACAGGGGAAACTCAGTCTACTAGTTGTTTTAAAAATGACGCTGCAGC 

3d + + + + ---- + + 360 

TTTACATGGGTTTTGTCCCCTTTGAGTCAGATGATCAACAAAATTTTTACTGCGACGTCG 



Apol 
Tsp509I 
Bbvl 
Dpnl 
Bglll | 
BstYI | 
Sau3AI 



Ddel 
Mill I | Bpml 
I I 



Acelll 
BsaJI 
BstDSI 
CviRI 
Cjel | 
Clal| | 
Taql | | 
I I I 



Sf aNI 

I I 
TGGAGATCTAAATTTCTTAGGAGGGGGATTTTCTTTCACATTTAGCAATATCGATGCAAC 

361 + + + - — + + + 420 

ACCTCTAGATTTAAAGAATCCTCCCCCTAAAAGAAAGTGTAAATCGTTATAGCTACGTTG 



Beef I 
Fnu4HI 

Alul 
CviJI 
Tsel 
Mwol 
Hpyl78III 
CviJI | 
Mwol | j 
Bbvl | j j 

Ml I 



Alul 

CviJI 

Fnu4HI 

Tsel | 

Bpml | j 

Cjel Ml. 

I I II I II 
CACGGCTTCTGGAGCTGCTATTGGAAGTGAAGCAGCTAATAAGACAGTCACGTTATCAGG 

421 + + + + + + 480 

GTGCCGAAGACCTCGACGATAACCTTCACTTCGTCGATTATTCTGTCAGTGCAATAGTCC 



Hpyl78III 
Maell 
Maelll 
Taal 
Tsp45I 
Bbvl I 



Maelll 
Taal 
Tsp45I 
Rsal | 
TatI | |TspRI 



Alul 
CviJI 



481 



Msel 

I Mil I 

ATTTTCGGCACTTTCTTTTCTTAAATCCCCAGCAAGTACAGTGACTAATGGATTGGGAGC 

+ + + + + + 

TAAAAGCCGTGAAAGAAAAGAATTTAGGGGTCGTTCATGTCACTGATTACCTAACCCTCG 



540 
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2 (con't) 



Apol 
Tsp509I CviJI 
Msel |MseI | 

I 



Hin4I 
Cjel 
Tsp509I 
Hpyl78III | 
Hinfl | I 

Tfil I j 

I I I 



TATCAATGTTAAAGGGAATTTAAGCCTATTGGATAATGATAAGGTATTGATTCAGGACAA 



541 



-+ 600 



ATAGTTACAATTTCCCTTAAATTCGGATAACCTATTACTATTCCATAACTAAGTCCTGTT 



Tsp509I 
Msel 
Vspl 
Tsp509I 
Ecil | 
Acil| j 
Bed | j j 



Nlaiv 
CviJI | 
Cac8I | 
CviRI | | 
|CjeI | | | 
I I I I 

TTTCTCAACAGGAGATGGCGGAGCAATTAATTGTGCAGGCTCCTTGAAGATCGCAAACAA 

601 + + + + + + 660 

AAAGAGTTGTCCTCTACCGCCTCGTTAATTAACACGTCCGAGGAACTTCTAGCGTTTGTT 



Dpnl 
BsmFl| MboII 
Sau3AI | |BsgI | 
III I I 



BsaAI 
Pmll 
Maell | 

XmnI Afllll | j Ecil Hinfl 
Tthlllll MboII | Bsbl ||Acil| Tfil 

I II I II II I 

TAAGTCCCTTTCTTTTATTGGAAATAGTTCTTCAACACGTGGCGGAGCGATTCATACCAA 



661 



■+ 720 



ATTCAGGGAAAGAAAATAACCTTTATCAAGAAGTTGTGCACCGCCTCGCTAAGTATGGTT 



Fnu4HI 
CviJI | 
Tsel j 

MWOl | \ 

Bbvl Haell | | 

MboII Mnll Cjel Hhal | j j 

II I ll I I 

AAACCTCACACTATCTTCTGGTGGGGAAACTCTATTTCAGGGGAATACAGCGCCTACGGC 

721 + + + + + + 780 

TTTGGAGTGTGATAGAAGACCACCCCTTTGAGATAAAGTCCCCTTATGTCGCGGATGCCG 
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Fig. 2 (con't) 



Nlaiv 
BanI 
AlwNI 
Hinfl 
CviRI 



Mnll 



Plel 
Hpyl78III 
Nrul 

Hin4I Thai 
Bcefl |CjeI | 
I I I 



Hpyl78III 
BsmAl | 
I I 



TGCTGGTAAAGGAGGTGCTATCGCGATTGCAGACTCTGGCACCCTATCCATTTCTGGAGA 

781 + + + + + + 840 

ACGACCATTTCCTCCACGATAGCGCTAACGTCTGAGACCGTGGGATAGGTAAAGACCTCT 



TspRI 



Taal 



Sf cl 
Alul | 
Cvi JI j 



Ppil 
Taal | BsmAI 

Bpml CviJI| NlalV ||BsmBI 

ill ii ill i 

CAGTGGCGACATTATCTTTGAAGGCAATACGATAGGAGCTACAGGAACCGTCTCTCATAG 

841 + + + + + + 900 

GTCACCGCTGTAATAGAAACTTCCGTTATGCTATCCTCGATGTCCTTGGCAGAGAGTATC 

Hhal 
Fspl 
Fnu4HI 
Tsel 
BsaAI 
Mwol 
Maell | 
Maelll | j 
Bbvl | j j 



Ddel 
Haell 
Hhal | 
Eco47III | | 
Bf al | | | 
I III 



1 



TGCTATTGATTTAGGAACTAGCGCTAAGATAACTGCGTTACGTGCTGCGCAAGGACATAC 

901 + + + + + + 960 

ACGATAACTAAATCCTTGATCGCGATTCTATTGACGCAATGCACGACGCGTTCCTGTATG 



Hpyl88IX 
Dpnl | 
Sau3AI | | 
Alwl | | j 

I 



Alwl 
SfaNI 
BsaBI 
Taql | 

Maelll Dpnl | j 

Taal Sau3AI | | j 
I II 



GATATACTTTTATGATCCGATTACTGTAACAGGATCGACATCTGTTGCTGATGCTCTCAA 

961 + + + + + + 1020 

CTATATGAAAATACTAGGCTAATGACATTGTCCTAGCTGTAGACAACGACTACGAGAGTT 
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Fig. 2 (con't) 



Msel 
Vspl 
Sspl I CviJI 
I I 



Drdll 
NlalV 
BscGI 
Bpml | 
BstZ17I | | 
BslI BsrI Sthl32I AccI | || 
II I II II 



Hpyl78III 
I 



1021 



TATTAATAGCCCTGATACTGGAGATAACAAAGAGTATACGGGAACCATAGTCTTTTCTGG 

+ + (. + + + 

ATAATTATCGGGACTATGACCTCTATTGTTTCTCATATGCCCTTGGTATCAGAAAAGACC 



1080 



Alul 
CviJI 
Mnll 
Hin4I | 
I 



Alul 
BplI 
CviJI 
Bpml | 
I 



Tsp509I 
Acil MboII | 
I I I 



AGAGAAGCTCACGGAGGCAGAAGCTAAAGATGAGAAGAACCGCACTTCTAAATTACTTCA 



1081 



-+ 1140 



TCTCTTCGAGTGCCTCCGTCTTCGATTTCTACTCTTCTTGGCGTGAAGATTTAATGAAGT 



1141 



BsmFI 

Dral Taal Dral | HphI 

Msel | Sfcl| Msel | | Msel 

II II III I 

AAATGTTGCTTTTAAAAATGGGACTGTAGTTTTAAAAGGTGATGTCGTTTTAAGTGCGAA 

+ + + + + + 

TTTACAACGAAAATTTTTACCCTGACATCAAAATTTTCCACTACAGCAAAATTCACGCTT 



1200 



Hpyl78III 

SfaNI 

Ppil| 
Taal | | 
Xmnl | | Ddel 

II I 



Fokl 
Ddel 
BseMII | 
Hin4I | | 
CviRI | | j 

II II 



Aatll 
BsaHI | 
Maell 



CviRI 
BsmFI I 



I I 



CGGTTTCTCTCAGGATGCAAACTCTAAGTTGATTATGGATTTAGGGACGTCGTTGGTTGC 

120 1 + + + + + + 1260 

GCCAAAGAGAGTCCTACGTTTGAGATTCAACTAATACCTAAATCCCTGCAGCAACCAACG 



Apol 

Tsp509I Plel Hpyl78III 

Hindi | Msel | Ddel | 

Tthlllll Hpal j Vspl j Sthl32I| j 

Taql|Msel| | Tsp509I | j Hinfl || j BscGI 

II II I III I M I I 
AAACACCGAAAGTATCGAGTTAACGAATTTGGAAATTAATATAGACTCTCTCAGGAACGG 



1261 



-+ 1320 



TTTGTGGCTTTCATAGCTCAATTGCTTAAACCTTTAATTATATCTGAGAGAGTCCTTGCC 
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Fig. 2 (con't) 



Bbvl 
BseMII | 



Hpyl8 8IX 
Ddel 
Alul | 
TspRI Cvi JI | 
Fnu4HI | BseMII | | 
Ddel Tsel | |MwoI | j j 
I Ml 



Dpnl 
Sau3AI | 
BseMII | j 

Acelll | I | 

I i M 



GAAAAAGATAAAACTCAGTGCTGCCACAGCTCAGAAAGATATTCGTATAGATCGTCCTGT 

1321 + + + + + 

CTTTTTCTATTTTGAGTCACGACGGTGTCGAGTCTTTCTATAAGCATATCTAGCAGGACA 



+ 1380 



BsrI 
Tsp509I 
Rsal | 
Tat I | Hin4I 

ill I >i il 

TGTACTGGCAATTAGCGATGAGAGTTTTTATCAAAATGGCTTTTTGAATGAGGACCATTC 



Mnll 
CviJI | 



Avail 
Sau96I 



MslI 



1381 



ACATGACCGTTAATCGCTACTCTCAAAAATAGTTTTACCGAAAAACTTACTCCTGGTAAG 



-+ 1440 



Hpyl7 8III 
Smll 
SfaNI 
Hinf I 
Tfil 
Bed | 
Bsllj j 
Xcml | j | 



Hpyl78III 
Bce83I I 



Hinf I 
Tfil 
PstI | 
CviRI | | 
Sfcl | j j SfaNI 

I Ml I 



CTATGATGGGATTCTTGAGTTAGATGCTGGGAAAGACATCGTGATTTCTGCAGATTCTCG 



1441 + + + + 

GATACTACCCTAAGAACTCAATCTACGACCCTTTCTGTAGCACTAAAGACGTCTAAGAGC 



+ 1500 



Muni 

Rsal Tsp509I 
BsrGI | D P nI I 

TatI CviJI Sau3AI | | AccI 

M i I I I I 

CAGTATAGATGCTGTACAATCTCCGTATGGCTATCAGGGAAAGTGGACGATCAATTGGTC 



1501 



1560 



GTCATATCTACGACATGTTAGAGGCATACCGATAGTCCCTTTCACCTGCTAGTTAACCAG 

Mnll 
TspRI 

BseMII Bpull02l[ 
Msel | Ddel | 

Hin4I | j BtsI | | 



Alul 
CviJI Taal 



I 



1561 



TACTGATGATAAGAAAGCTACGGTTTCTTGGGCGAAGCAGAGTTTTAATCCCACTGCTGA 

h + + + + + 

ATGACTACTATTCTTTCGATGCCAAAGAACCCGCTTCGTCTCAAAATTAGGGTGACGACT 



1620 
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Fig. 2 (con't) 



BsaXI 
NlalV 
Cvi JI | MboII 
I I I 

GCAGGAGGCTCCGTTAGTTCCTAATCTTCTTTGGGGTTCTTTTATAGATGTTCGTTCCTT 



1621 



CGTCCTCCGAGGCAATCAAGGATTAGAAGAAACCCCAAGAAAATATCTACAAGCAAGGAA 



■+ 1680 



Apol Bfal 
Tsp509I Alul | BsiHKAI 

Hpyl78III | Cvi JI I Rsal Bspl286I Eco57I CviRI 

II III I I I 

CCAGAATTTTATAGAGCTAGGTACTGAAGGTGCTCCTTACGAAAAGAGATTTTGGGTTGC 

1681 + + + + + + 1740 

GGTCTTAAAATATCTCGATCCATGACTTCCACGAGGAATGCTTTTCTCTAAAACCCAACG 



Hpyl78III 
BsiEI | 

BsaXI | j MslI 
Acil Ml" Apol Mnll| 

Cac8I CviRI BsrBI | | j Tsp509I Nlalll j 

| I I I I I I M 

AGGCATTTCCAATGTTTTGCATAGGAGCGGTCGTGAAAATCAAAGGAAATTCCGTCATGT 

1741 + + + + + + 1800 

TCCGTAAAGGTTACAAAACGTATCCTCGCCAGCACTTTTAGTTTCCTTTAAGGCAGTACA 



Sf Cl 
BsaXI | 
I I 



MslI 
Sthl32I 
BssSI 
Cac8I 
CjePI 
Bfal 
Mill I 
SfaNI 
Nhel | 
II 



Neil HaelV BsmAI 

ScrFI Hin4I CjePI | 

Mspl| Fokl | HphI | | 

II I I III 



GAGTGGAGGTGCTGTAGTAGGTGCTAGCACGAGGATGCCGGGTGGTGATACCTTGTCTCT 

1801 + + + + + + I860 

CTCACCTCCACGACATCATCCACGATCGTGCTCCTACGGCCCACCACTATGGAACAGAGA 



Alul 
CviJI 
Bpull02I | 
Ddel 



Acelll 
BseMII 
Maelll 
Tsp45I 
Hhal | 
Thai j 
Mwol | | 
I II 



Tsp509I 



GGGTTTTGCTCAGCTCTTTGCGCGTGACAAAGACTACTTTATGAATACCAATTTCGCAAA 

1861 + + + + + + 1920 

CCCAAAACGAGTCGAGAAACGCGCACTGTTTCTGATGAAATACTTATGGTTAAAGCGTTT 
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Fig. 2 (con't) 



Fnu4HI 
CviRI | 
Tsel | 
Dpnl SfaNI | 
BstYI | Alwl | | 
Sau3AI | Maell | | I Bbvl 
II 111 

GACCTACGCAGGATCTTTACGTTTGCAGCACGATGCTTCCCTATACTCTGTGGTGAGTAT 



1921 



■+ 1980 



CTGGATGCGTCCTAGAAATGCAAACGTCGTGCTACGAAGGGATATGAGACACCACTCATA 



Dpnl 
BstYI 
Sau3AI 
BciVI Thai 
Mnll Acil | 

HphI | Alwl | 

Mnll |PleI Hinfl | | 

I 



Plel 
Bcefl | Hinfl 
I I I 



1981 



CCTTTTAGGAGAGGGAGGACTCCGCGAGATCCTGTTGCCTTATGTTTCCAAGACTCTGCC 

+ + (- h h + 

GGAAAATCCTCTCCCTCCTGAGGCGCTCTAGGACAACGGAATACAAAGGTTCTGAGACGG 



2040 



BsiHKAI 
Bspl286I 



CviJI 
Haelll 
Bbvl 
Eael 
Gdill 
Alul 



CviJI 
Bpull02I 
Ddel 
Alul | 
CviJI | 
Fnu4HI | | 
Tsel | || 
I 



Bbsl 
Hin4I 
Hinfl 
Tthllll 
Nlalll 
Alwl 



Bcefl | 
Dpnl | | 
Sau3AI | | | 
II II 



GTGCTCTTTCTATGGGCAGCTTAGCTACGGCCATACGGATCATCGCATGAAGACCGAGTC 



2041 



CACGAGAAAGATACCCGTCGAATCGATGCCGGTATGCCTAGTAGCGTACTTCTGGCTCAG 



-+ 2100 



Taqll 
Plel | 
BsmAI | j 
MboII | | | 
II 



Dpnl 
Sau3AI 
Hgal 
Hpyl78III | 
Taql | 
Sthl32I | | 
I I I 



Mnll 
Alwl | 
I I 



Acelll 
BsaXI | 
Hin4I | 



TCTACCCCCCCCCCCCCCGACGCTCTCGACGGATCATACTTCTTGGGGAGGATATGTCTG 

2101 + + + + + + 2160 

AGATGGGGGGGGGGGGGGCTGCGAGAGCTGCCTAGTATGAAGAACCCCTCCTATACAGAC 
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. 2 (con't) 

Fnu4HI 

TaqI Taul 
Aval] Acil| 
Alul Smllj MspAllj 
CviJI CviJI Xholj Bpml Mnll | | 

i i M i mi 

GGCTGGAGAGCTGGGAACTCGAGTTGCTGTTGAAAATACCAGCGGCAGAGGATTTTTCCA 

2161 + + + + + + 2220 

CCGACCTCTCGACCCTTGAGCTCAACGACAACTTTTATGGTCGCCGTCTCCTAAAAAGGT 

Alul 

Rsal Alul Cac8I CviJI 

TatI I CviJI Mwol | Mwol | 

M i i I i I 

AGAGTACACTCCATTTGTAAAAGTCCAAGCTGTTTACGCTCGCCAAGATAGCTTTGTAGA 

2221 + + + + + + 2280 

TCTCATGTGAGGTAAACATTTTCAGGTTCGACAAATGCGAGCGGTTCTATCGAAACATCT 

BsaBI 

Alul Hinfl | CjePI Hinfl 

Bfal CviJI Hpyl78III Tfil | SfaNI Tfil 

II I II II 

ACTAGGAGCTATCAGTCGTGATTTTAGTGATTCGCATCTTTATAACCTTGCGATTCCTCT 

2281 + + + + + + 2340 

TGATCCTCGATAGTCAGCACTAAAATCACTAAGCGTAGAAATATTGGAACGCTAAGGAGA 

Mnll 

Hinfl | CviRI Bpml Tthlllll 

Tfil CjePI Taal | Sspl Nlalll | Hin4I | 

ii ill i ii M 

TGGAATCAAGTTAGAGAAACGGTTTGCAGAGCAATATTATCATGTTGTAGCGATGTATTC 

2341 + + + + + + 2400 

ACCTTAGTTCAATCTCTTTGCCAAACGTCTCGTTATAATAGTACAACATCGCTACATAAG 

BslI 

Maelll BsaJI | 

Hpyl78III Pflll08I | Rsal Mmel Styl | 

i ii i i u 

TCCAGATGTTTGTCGTAGTAACCCCAAATGTACGACTACCCTACTTTCCAACCAAGGGAG 

2401 + + + + + + 2460 

AGGTCTACAAACAGCATCATTGGGGTTTACATGCTGATGGGATGAAAGGTTGGTTCCCTC 

BSU36I 

NspV Ddel 
TaqI CviJI 
MboII | Hael 
Mmel | j CviJI Hael II 

Bbsl| || Ddel Mwol | StuI | Acelll 

II II I II M I 

TTGGAAGACCAAAGGTTCGAACTTAGCAAGACAGGCTGGTATTGTTCAGGCCTCAGGTTT 

2461 + + + + + + 2520 

AACCTTCTGGTTTCCAAGCTTGAATCGTTCTGTCCGACCATAACAAGTCCGGAGTCCAAA 
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2 (con't) 



BseMII 
Dpnl 
Sau3AI 
Bbvl | 
Mnll j 
Taql| 
I I 



Hpyl78III 
Alul 
CviJI 
Sthl32I 



PstI 
Fnu4HI 
CviRI 
Tsel 
Fnu4HI 
Sf cl 
Alul | 
CviJI | 
Tsel 



Bbvl 



Dpnl 
BstYI 

Faul Sau3AI 
Sthl32l| Acil | 
CviJI | j MboII | | 

I II III 



TCGATCTTTGGGAGCTGCAGCAGAGCTTTTCGGGAACTTTGGCTTTGAATGGCGGGGATC 

2521 + + + + + + 2580 

AGCTAGAAACCCTCGACGTCGTCTCGAAAAGCCCTTGAAACCGAAACTTACCGCCCCTAG 



Faul 
Sthl32I 
Sf aNI 
Alul 



CviJI 
Pflll08I | 
Alwl I 



Acil 
I 



Apol 
Tsp509I 



SfaNI 



TaqI 



TTCTCGTAGCTATAATGTAGATGCGGGTAGCAAAATCAAATTTTAGCGATTTCTCTTTCG 

2581 + + + + + + 2640 

AAGAGCATCGATATTACATCTACGCCCATCGTTTTAGTTTAAAATCGCTAAAGAGAAAGC 



CviJI 
Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 



Dral 
Msel | 
I I 



Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 
CviJI I 



BsaAI 
HaelV 
Hin4I 
SnaBI 
Maell | 

I I 



ATGCTATTTTTCCATGGCTATTTTTAAAATGATAGCCATGGTTATAGATACGTAGTCCTT 

2641 + + + + + + 2700 

TACGATAAAAAGGTAC CGATAAAAATTTTACTATCGGT AC CAATATCTATG C ATC AGGAA 



CviRI 
MboII 
TspRl| 
Taal | | 
Bbsl | j j 
II II 



Dpnl 
Sau3AI 
Hpyl8 8IX| 
Alwl | j 
Hin4I | | j 
I I II 



ATTTCAAAGAAGACACTGTTGCATTAGATACGCTCTCTGATCCCTCAAAA 

2701 + + + + + 2750 

TAAAGTTTCTTCTGTGACAACGTAATCTATGCGAGAGACTAGGGAGTTTT 
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Figure 3: CPN1 00421 

ctcctgtccc tcgcgttgtc aacctacccc tcctccctcg aattctaatc ctttgaacgt 60 



agtacaacag cctgttgctg catcgtcagt gccttcctac atg ccc cca ctg aat 

Met Pro Pro Leu Asn 
1 5 



115 



get gat gat gtt etc cct aga gac cat ctg tea gat gga agt ttc tea 163 
Ala Asp Asp Val Leu Pro Arg Asp His Leu Ser Asp Gly Ser Phe Ser 
10 15 20 



gat acg tat cca gac att aca acg caa gcg ate ate tta att ttc ttg 
Asp Thr Tyr Pro Asp lie Thr Thr Gin Ala lie lie Leu lie Phe Leu 
25 30 35 

gee eta teg cct ttc ctg gtc atg ttg etc act teg tat eta aag att 
Ala Leu Ser Pro Phe Leu Val Met Leu Leu Thr Ser Tyr Leu Lys lie 
40 45 50 

ate att act tta gtc tta tta cgt aac gec tta gga gta caa caa aca 
lie lie Thr Leu Val Leu Leu Arg Asn Ala Leu Gly Val Gin Gin Thr 
55 60 65 

cct ccc agt caa gtc etc aat ggg att gca etc ate eta tct att tat 
Pro Pro Ser Gin Val Leu Asn Gly lie Ala Leu lie Leu Ser lie Tyr 
70 75 80 85 

gtg atg ttc ccc acg gga gtg get atg tat aaa gat get cgc aag gaa 
Val Met Phe Pro Thr Gly Val Ala Met Tyr Lys Asp Ala Arg Lys Glu 
90 95 100 

ate gaa gee aat ace att cct caa age etc ttc act gca gaa ggt get 
He Glu Ala Asn Thr He Pro Gin Ser Leu Phe Thr Ala Glu Gly Ala 
105 HO 115 

gaa aca gtg ttt gtc get tta aac aaa tct aaa gaa cct ttg cgc tct 
Glu Thr Val Phe Val Ala Leu Asn Lys Ser Lys Glu Pro Leu Arg Ser 
120 125 130 

ttc tta att cgc aac act cca aaa gca caa att caa age ttt tac aag 
Phe Leu He Arg Asn Thr Pro Lys Ala Gin He Gin Ser Phe Tyr Lys 
135 140 145 

ate tea cag aaa ace ttc cct teg gaa att cga gcg cac etc act gee 
He Ser Gin Lys Thr Phe Pro Ser Glu He Arg Ala His Leu Thr Ala 
150 155 160 165 

tec gac ttt gta ate att att cct get ttt att atg ggt cag ata aaa 
Ser Asp Phe Val He He He Pro Ala Phe He Met Gly Gin He Lys 
170 175 180 

aat get ttc gaa att gga gtc ttg ate tat eta cct ttc ttt gtt att 
Asn Ala Phe Glu He Gly Val Leu He Tyr Leu Pro Phe Phe Val He 
185 190 195 



211 



259 



307 



355 



403 



451 



499 



547 



595 



643 



691 



16/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Fig. 3 (con't) 

gat tta gtg act get aac gtt ctt gta gcg atg cag atg atg atg tta 73 9 

Asp Leu Val Thr Ala Asn Val Leu Val Ala Met Gin Met Met Met Leu 

200 205 210 

tec cct eta teg att teg tta cct tta aag tta ctt ttg ate gtc atg 787 
Ser Pro Leu Ser lie Ser Leu Pro Leu Lys Leu Leu Leu lie Val Met 
215 220 225 

gta gac gga tgg aca tta ctg etc caa ggg ctt atg ate age ttt aaa 835 
Val Asp Gly Trp Thr Leu Leu Leu Gin Gly Leu Met lie Ser Phe Lys 
230 235 240 245 

taaggacacg tgccgtgtta gcatttttcg caactagttt caaatctgtt ctttttgagt 895 

actcctacca atcattatta cttattttga ttgtttcggc acctcccatc atcttagctt 955 

ecatagtegg gattatggtt gcgatcttcc aagccgcaac acaaa 1000 



17/165 

SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Mnll Maell 



Figure 4 (RY-34) 

Restriction enzyme analysis of CP 100421 

Mnll 
Apol 
EcoRI 

BseRI Tsp509I 
Hindi | Mnll | 

Thai Mnll |! BsaXI TaqI | j 

I III I Ml 

CTCCTGTCCCTCGCGTTGTCAACCTACCCCTCCTCCCTCGAATTCTAATCCTTTGAACGT 

! + + + + + + 60 

GAGGACAGGGAGCGCAACAGTTGGATGGGGAGGAGGGAGCTTAAGATTAGGAAACTTGCA 

Bbvl CviRI 
Rsal| Fnu4HI | TspRI Nlalll BsmI 

TatI I CviJI Tsel| [ SfaNI | Nspl TspRI | 

Mil III I I I II 

AGTACAACAGCCTGTTGCTGCATCGTCAGTGCCTTCCTACATGCCCCCACTGAATGCTGA 

61 + + + + + + 120 

TCATGTTGTCGGACAACGACGTAGCAGTCACGGAAGGATGTACGGGGGTGACTTACGACT 

BseMII 

Hpyl88IX Hpyl78III 
Ahdl | BsaAI 
Bfal HaeIV| j BsaBI 
Bsal | Hin4l| j Hpyl88IX SnaBI 

BsmAI j Bed | |Xcml |Bcd Ddel |MaeII| | BciVI 

II III II I I I II I I 

TGATGTTCT C C CTAG AG AC CATCTGTCAG ATGGAAGTTTCTCAGATACGTAT C CAG AC AT 

121 + + + + + + 180 

ACTACAAGAGGGATCTCTGGTAGACAGTCTACCTTCAAAGAGTCTATGCATAGGTCTGTA 

Dpnl CviJI 
Sau3AI |Tsp509I Haelll ScrFI 

Cac8I | j Msel| Sau96I | EcoRII | Nlalll 

I I I II II II I 

TACAACGCAAGCGATCATCTTAATTTTCTTGGCCCTATCGCCTTTCCTGGTCATGTTGCT 

181 + + + + + + 240 

ATGTTGCGTTCGCTAGTAGAATTAAAAGAACCGGGATAGCGGAAAGGACCAGTACAACGA 

BsaAI 
Maelll 

SnaBI Bsu36I Rsal 
Maell| Ddel TatI | 

II III 
CACTTCGTATCTAAAGATTATCATTACTTTAGTCTTATTACGTAACGCCTTAGGAGTACA 

241 + + + + + + 300 

GTGAAGCATAGATTTCTAATAGTAATGAAATCAGAATAATGCATTGCGGAATCCTCATGT 
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Fig. 4 (con't) 



Mnll 
Tthlllll 
Bmrl | CviRI 
Bsrl| | Fokl Mnll | Sthl32I 

II I I I I I 

ACAAACACCTCCCAGTCAAGTCCTCAATGGGATTGCACTCATCCTATCTATTTATGTGAT 

301 + + + + + + 360 

TGTTTGTGGAGGGTCAGTTCAGGAGTTACCCTAACGTGAGTAGGATAGATAAATACACTA 



SfaNI 
CviJI 
BstXI 
MslI 



BscGI | 
BsaJI | I 
BstDSI I | 
II 



TaqI 

Bcgl Cac8I CjePI | CviJI Bcgl 
I I II I I 



361 



GTTCCCCACGGGAGTGGCTATGTATAAAGATGCTCGCAAGGAAATCGAAGCCAATACCAT 

+ + + + + + 

CAAGGGGTGCCCTCACCGATACATATTTCTACGAGCGTTCCTTTAGCTTCGGTTATGGTA 



420 



Tthlllll 
BstAPI 
Pstl 
TspRI 
CviRI 
Mnll 
BtsI 
Sf CI 
Earl 
CjePI 
Mnll [ 
CviJI | I 
MboII | | I I I I iMwoI 

! Ill 

TCCTCAAAGCCTCTTCACTGCAGAAGGTGCTGAAACAGTGTTTGTCGCTTTAAACAAATC 



TspRI 
Taal [ 
I 



Dral 
Msel I 



421 



-+ 480 



AGGAGTTTCGGAGAAGTGACGTCTTCCACGACTTTGTCACAAACAGCGAAATTTGTTTAG 



Alul 
CviJI 
Hindi I I | 
Apol | | 

Bsbl Tsp509I | | 

III I III 

TAAAGAACCTTTGCGCTCTTTCTTAATTCGCAACACTCCAAAAGCACAAATTCAAAGCTT 



Tsp509I 
Hhal Msel I 



481 



-+ 540 



ATTTCTTGGAAACGCGAGAAAGAATTAAGCGTTGTGAGGTTTTCGTGTTTAAGTTTCGAA 
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4 (con't) 



Dpnl 
Bglll | 
BstYI j 
Sau3AI j 



Apol 

Tsp509I Hhal 
XmnI Hpyl88IX j TaqI Hin4I 
I I 



Hpyl8 8IX 
Mnll | 
TspRI | | 
Btsl | | | 
I Ml 



TTACAAGATCTCACAGAAAACCTTCCCTTCGGAAATTCGAGCGCACCTCACTGCCTCCGA 



541 



■+ 600 



AATGTTCTAGAGTGTCTTTTGGAAGGGAAGCCTTTAAGCTCGCGTGGAGTGACGGAGGCT 

Tsp509I 

Mnll Hpyl8 8IX NspV | 

BplI I Mmel SimI | TaqI j Hinfl 

II I II III 

CTTTGTAATCATTATTCCTGCTTTTATTATGGGTCAGATAAAAAATGCTTTCGAAATTGG 



601 



-+ 660 



GAAACATTAGTAATAAGGACGAAAATAATACCCAGTCTATTTTTTACGAAAGCTTTAACC 



Dpnl 
Plel 
Sau3AI | 

Hpyl78IIl| j Maelll 
Hin4I I Tsp45I 

mi I ill 

AGTCTTGATCTATCTACCTTTCTTTGTTATTGATTTAGTGACTGCTAACGTTCTTGTAGC 

661 + + + + + + 720 

TCAGAACTAGATAGATGGAAAGAAACAATAACTAAATCACTGACGATTGCAAGAACATCG 



Sf aNI 
Acll | 
Mae 1 1 



Maelll 

Mnll | Maelll 

Clal | | Dral | Dpnl 

CviRI TaqI j j Msel | j Sau3AI | 

| III III M 

GATGCAGATGATGATGTTATCCCCTCTATCGATTTCGTTACCTTTAAAGTTACTTTTGAT 

721 + + + + + + 780 

CTACGTCTACTACTACAATAGGGGAGATAGCTAAAGCAATGGAAATTTCAATGAAAACTA 

Beef I 
Dral 
Msel 
Alul 
CviJI 
Dpnl 
Bell 
Sau3AI 
CviJI 
Mwol 
BsaJI | 
Styl | 

Bed Fokl | | 



Ace I 
Nlalll | 



781 



ll I III 

CGTCATGGTAGACGGATGGACATTACTGCTCCAAGGGCTTATGATCAGCTTTAAATAAGG 

+ + + + + + 840 

GCAGTACCATCTGCCTACCTGTAATGACGAGGTTCCCGAATACTAGTCGAAATTTATTCC 
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4 (con't) 

BsaAI 

Pmll RsaI 

Maell| Bfal Seal 

Afllll I Mwol Mwol Spel| TatI | 

Ml I I II M 

ACACGTGCCGTGTTAGCATTTTTCGCAACTAGTTTCAAATCTGTTCTTTTTGAGTACTCC 

841 + + + + + + 900 

TGTGCACGGCACAATCGTAAAAAGCGTTGATCAAAGTTTAGACAAGAAAAACTCATGAGG 

Sthl32I 

Alul 
CviJI 

NlalV Ddel | 

Banl | Bed Mnllj | 

II I II I 

TACCAATCATTATTACTTATTTTGATTGTTTCGGCACCTCCCATCATCTTAGCTTCCATA 

901 + + + + + + 960 

ATGGTTAGTAATAATGAATAAAACTAACAAAGCCGTGGAGGGTAGTAGAATCGAAGGTAT 

Acil 

Dpnl Fnu4HI 

Hpyl78III Sau3AI | Taul 

BslI | MboII | I CviJI) Bsbl 

M IM II I 

GTCGGGATTATGGTTGCGATCTTCCAAGCCGCAACACAAA 

961 + + + + 1000 

CAGCCCTAATACCAACGCTAGAAGGTTCGGCGTTGTGTTT 



21/165 

SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Figure 5: 

tagctttata caaagtatag aaaaataaca cgacaataaa aggagcggtg ttttctcttc 60 

tgaggtaaat cagcctcaaa gatactacgc catagtaaag atg aag ttt ttt age 115 

Met Lys Phe Phe Ser 
1 5 

tta att ttt aaa gat gat gat gtc tec cca aat aag aag gtt tta tct 163 

Leu lie Phe Lys Asp Asp Asp Val Ser Pro Asn Lys Lys Val Leu Ser 

10 15 20 

cct gaa get ttc tct get ttc ctt gat gee aaa gag ctg tta gaa aaa 211 

Pro Glu Ala Phe Ser Ala Phe Leu Asp Ala Lys Glu Leu Leu Glu Lys 
25 30 35 



aca aaa gee gat age gaa gee tat gtt gca gag aca gaa caa aag tgt 
Thr Lys Ala Asp Ser Glu Ala Tyr Val Ala Glu Thr Glu Gin Lys Cys 
40 45 50 

gca caa att cgt caa gaa get aaa gat caa gga ttt aaa gag gga tct 
Ala Gin lie Arg Gin Glu Ala Lys Asp Gin Gly Phe Lys Glu Gly Ser 
55 60 65 

gaa tec tgg age aag caa att get ttc tta gaa gaa gaa act aaa aat 
Glu Ser Trp Ser Lys Gin He Ala Phe Leu Glu Glu Glu Thr Lys Asn 
70 75 80 85 

eta cgc ata aga gta cgc gag gee ttg gtt cct ctg gca att gcg agt 
Leu Arg He Arg Val Arg Glu Ala Leu Val Pro Leu Ala He Ala Ser 
90 95 100 

gtg agg aaa ate att ggg aag gaa etc gaa tta cat cct gaa act att 
Val Arg Lys He He Gly Lys Glu Leu Glu Leu His Pro Glu Thr He 
105 HO H5 

gtc tct att att tct caa gca ttg aaa gag etc aca caa aat aaa cat 
Val Ser He He Ser Gin Ala Leu Lys Glu Leu Thr Gin Asn Lys His 
120 125 130 

ate att ate tct gtc aat ccc aaa gat tta cct ctt gtt gag aaa agt 
He He He Ser Val Asn Pro Lys Asp Leu Pro Leu Val Glu Lys Ser 
135 140 145 

cgt cct gaa etc aag aac ate gtg gag tat get gac tec tta att ctt 
Arg Pro Glu Leu Lys Asn He Val Glu Tyr Ala Asp Ser Leu He Leu 
150 155 160 165 

aca gca aaa cct gat gtt act cct ggg ggt tgc att ate gag act gaa 
Thr Ala Lys Pro Asp Val Thr Pro Gly Gly Cys He He Glu Thr Glu 
170 175 180 

gca ggg ate ate aat gcg cag ctt gat gta caa tta gat gee tta gaa 
Ala Gly He He Asn Ala Gin Leu Asp Val Gin Leu Asp Ala Leu Glu 
185 190 195 



259 



307 



355 



403 



451 



499 



547 



595 



643 



691 
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Fig. 5 (con't) 

aaa get ttc teg act ata eta aaa gcg aag aac cct gta gac gag cca 

Lys Ala Phe Ser Thr lie Leu Lys Ala Lys Asn Pro Val Asp Glu Pro 
200 205 210 

tct gag act tea tea tec acg gat tct tct tct tta tct aat gat cag 

Ser Glu Thr Ser Ser Ser Thr Asp Ser Ser Ser Leu Ser Asn Asp Gin 
215 220 225 



739 



787 



gat aag aaa gaa taaaggtatt cactattatg cgatccattt ttcgattttc 839 

Asp Lys Lys Glu 

230 

cctttgtttt tttacgctga gcgtctcatg ctgatttget gacgccagtc tatatgaaaa 899 
n 900 
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Figure 6 (RY-35) 

Restriction analysis of CPN1 00422 

BseMII 
MboII | Ddel 

Alul AciI I I Hpyl8 8IX 

CviJI BsrBI | |MnlI | 

| I I I I I 

TAGCTTTATACAAAGTATAGAAAAATAACACGACAATAAAAGGAGCGGTGTTTTCTCTTC 

-L + + + + + + 60 

ATCGAAATATGTTTCATATCTTTTTATTGTGCTGTTATTTTCCTCGCCACAAAAGAGAAG 

Tsp509I 
Msel| 
Alul | | 

Earl CviJI Mnll CviJI | j 

III IN 

TGAGGTAAATCAGCCTCAAAGATACTACGCCATAGTAAAGATGAAGTTTTTTAGCTTAAT 

61 + + + + + + 120 

ACTCCATTTAGTCGGAGTTTCTATGATGCGGTATCATTTCTACTTCAAAAAATCGAATTA 

Alul 
CviJI 

Dra i Hindlll | SfaNI 

Msell Hin4I BsmAI Hpyl78III | j Mwol | 

ll II l I I l I 

TTTTAAAGATGATGATGTCTCCCCAAATAAGAAGGTTTTATCTCCTGAAGCTTTCTCTGC 

12 i + + + + + + 180 

AAAATTTCTACTACTACAGAGGGGTTTATTCTTCCAAAATAGAGGACTTCGAAAGAGACG 

CviRI 

Eco57I Alul BsmAI | 

Acellll CviJI CviJI CviJI Mwol | 

ll i i ill 

TTTCCTTGATGCCAAAGAGCTGTTAGAAAAAACAAAAGCCGATAGCGAAGCCTATGTTGC 

181 + + + + + + 240 

AAAGGAACTACGGTTTCTCGACAATCTTTTTTGTTTTCGGCTATCGCTTCGGATACAACG 



Apol 
Tsp509I 
BsiHKAI 
Bspl286I 
BseSI | 
CviRI | 
ApaLI | j 
I I I 



Dpnl 
Sau3AI 
Alul | 
CviJI j 
Hpyl78III | | 

I I 



Dral 
Msel | 
Mnll | | 
I li 



AGAGACAGAACAAAAGTGTGCACAAATTCGTCAAGAAGCTAAAGATCAAGGATTTAAAGA 

+ + + + + + 

TCTCTGTCTTGTTTTCACACGTGTTTAAGCAGTTCTTCGATTTCTAGTTCCTAAATTTCT 
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Fig. 6 (con't) 



ScrFI 
EcoRII 
Alwl 
Hinfl 
Tf il 
BsaBI 



Hpyl8 8IX 
Dpnl | 
BstYI | I 
Sau3AI I j 
I I 



Tthlllll 

Bpml| Mboll 
Ddelj Mboll | 

I! II 

GGGATCTGAATCCTGGAGCAAGCAAATTGCTTTCTTAGAAGAAGAAACTAAAAATCTACG 

301 + + + + + + 360 

CCCTAGACTTAGGACCTCGTTCGTTTAACGAAAGAATCTTCTTCTTTGATTTTTAGATGC 



Tsp509I 
Cac8I j 
I I 



BsaJI 
Styl 
CviJI 
Hael 
Haelll 
Thai | 
Rsal | | 
Mnll | | StuI 
I I I 



361 



CjePI 
Mnll 
Mnll | 
NlalV Muni | | 

Drdll| Tsp509I | j 

II III 
CATAAGAGTACGCGAGGCCTTGGTTCCTCTGGCAATTGCGAGTGTGAGGAAAATCATTGG 

+ + + + + + 

GTATTCTCATGCGCTCCGGAACCAAGGAGACCGTTAACGCTCACACTCCTTTTAGTAACC 



420 



Hpyl7 8III 
CjePI | 
Tsp509I | j 
Fokl TaqI | j j 
I III I 



Bce83I BsmAI 
I 



Smll 



GAAGGAACTCGAATTACATCCTGAAACTATTGTCTCTATTATTTCTCAAGCATTGAAAGA 



421 



CTTCCTTGAGCTTAATGTAGGACTTTGATAACAGAGATAATAAAGAGTTCGTAACTTTCT 



- + 480 



Banll 
BsiHKAI 
Bspl286I 
SacI 
Tthlllll 
Alul | 
CviJI 

l i I 

GCTCACACAAAATAAACATATCATTATCTCTGTCAATCCCAAAGATTTACCTCTTGTTGA 

481 + + + + + + 540 

CGAGTGTGTTTTATTTGTATAGTAATAGAGACAGTTAGGGTTTCTAAATGGAGAACAACT 



Mnll 
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6 (con't) 



Hpyl78III 
Bce83I Hpyl78III Smll | 



541 



Tsp509I 
Plel Hinfl Msel| 
I I M 

GAAAAGTCGTCCTGAACTCAAGAACATCGTGGAGTATGCTGACTCCTTAATTCTTACAGC 

+ + + + + + 600 

CTTTTCAGCAGGACTTGAGTTCTTGTAGCACCTCATACGACTGAGGAATTAAGAATGTCG 



ScrFI 

BsaJl| Hpyl78III Eco57I 

EcoRIlJI BsmAI | Dpnl Fspl 

Maelll | I | CviRI |TaqI | Sau3AI | Alwl | 

I III I I I I M II 

AAAACCTGATGTTACTCCTGGGGGTTGCATTATCGAGACTGAAGCAGGGATCATCAATGC 



601 



■+ 660 



TTTTGGACTACAATGAGGACCCCCAACGTAATAGCTCTGACTTCGTCCCTAGTAGTTACG 



Tsp509I 

Alul Bbvl 

CviJI Rsal 

Fnu4HI | BsrGI | 

Hhal| | SfaNI | 

Tsel| | TatI | 

II M 



Hpyl78III 
Alul | 
CviJI j 
Ddel Hindlll ] TaqI 
I II I 



GCAGCTTGATGTACAATTAGATGCCTTAGAAAAAGCTTTCTCGACTATACTAAAAGCGAA 



661 



CGTCGAACTACATGTTAATCTACGGAATCTTTTTCGAAAGAGCTGATATGATTTTCGCTT 



-+ 720 



Ddel 
Hpyl8 8IX 
Fokl 
Bed | 
BsmAI | | 
CviJI Ml 

HaeIV| III | Hinfl 
Hin4I j | | | | Tf il 

BseMII | j | | I I MboII 
MboII | I I I I I I BsaJI | 

ACCI II M I j I I BstDSI j 

Sfcl || j j j | j j | Hin4I MboII j 
I II I Mill 

GAACCCTGTAGACGAGCCATCTGAGACTTCATCATCCACGGATTCTTCTTCTTTATCTAA 



721 



CTTGGGACATCTGCTCGGTAGACTCTGAAGTAGTAGGTGCCTAAGAAGAAGAAATAGATT 



■+ 780 



Hpyl78III 
Dpnl | 
Bell | | 
Sau3AI j j 

I I I 



Dpnl 
Sau3AI | 
Alwl | j 

Msll| j j 

II I I 



TaqI 



TGATCAGGATAAGAAAGAATAAAGGTATTCACTATTATGCGATCCATTTTTCGATTTTCC 



781 



-+ 840 



ACTAGTCCTATTCTTTCTTATTTCCATAAGTGATAATACGCTAGGTAAAAAGCTAAAAGG 
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Fig. 6 (con't) 

Nlalll 

Bpull02I BsmAI | BsrI 

Hgal | BsmBI j PshAI 

BseMII | Ddel Mwol | j BsaHI | Hgal 

I! I Ml III 

CTTTGTTTTTTTACGCTGAGCGTCTCATGCTGATTTGCTGACGCCAGTCTATATGAAAAC 

841 + + + + + + 900 

GAAACAAAAAAATGCGACTCGCAGAGTACGACTAAACGACTGCGGTCAGATATACTTTTG 
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Figure 7: CPN 100424 

tgttcgcgat tggcactaat cccccctttt gttatggtga ataaaaaggt atgcgtggat 60 



tatggttcgt cgatctattt ctttttgctt gttctttcta atg aca ttg ctg tgc 

Met Thr Leu Leu Cys 

1 5 

tgt aca age tgt aac age agg tct eta att gtg cac ggt ctt cct ggc 

Cys Thr Ser Cys Asn Ser Arg Ser Leu lie Val His Gly Leu Pro Gly 

10 15 20 



gag caa atg tgg gat ate gcg gtt ccg tea gca caa ate aca gag gee 
Glu Gin Met Trp Asp lie Ala Val Pro Ser Ala Gin lie Thr Glu Ala 
55 60 65 



ctg tta gat ctt ttt gca aaa caa ggt ctt gtt cct tec gag ctt cag 
Leu Leu Asp Leu Phe Ala Lys Gin Gly Leu Val Pro Ser Glu Leu Gin 
90 95 100 

gaa aaa ate cgt tat caa gaa ggc tta tea gaa cag atg gec tct acg 
Glu Lys lie Arg Tyr Gin Glu Gly Leu Ser Glu Gin Met Ala Ser Thr 
105 110 115 

att aga aaa atg gat ggc gtt gtc gat gee tea gta cag att tec ttc 
lie Arg Lys Met Asp Gly Val Val Asp Ala Ser Val Gin lie Ser Phe 
120 125 130 

act aca gaa aat gaa gat aat ctt cct tta aca gee tct gtg tat att 
Thr Thr Glu Asn Glu Asp Asn Leu Pro Leu Thr Ala Ser Val Tyr lie 
135 140 145 

aag cat cga ggg gtt ttg gac aat ccg aac age att atg gtt tec aaa 
Lys His Arg Gly Val Leu Asp Asn Pro Asn Ser lie Met Val Ser Lys 
150 155 160 ' 165 

att aag cgc ctt att gca agt get gtt cca gga ctt gtg cca gag aac 
lie Lys Arg Leu lie Ala Ser Ala Val Pro Gly Leu Val Pro Glu Asn 
170 175 180 

gtc tct gta gtg age gat cgc gca get tat agt gat att aca att aat 
Val Ser Val Val Ser Asp Arg Ala Ala Tyr Ser Asp lie Thr He Asn 
185 190 195 



115 



163 



aga gaa gcg aat gag att gtg gtg ctt ttg gta age aaa ggg gtg get 211 

Arg Glu Ala Asn Glu He Val Val Leu Leu Val Ser Lys Gly Val Ala 
25 30 35 

gca caa aaa ttg cct caa get gca gcg get aca gee gga gca get act 259 

Ala Gin Lys Leu Pro Gin Ala Ala Ala Ala Thr Ala Gly Ala Ala Thr 
40 45 50 



307 



ctt gee att eta aat caa gcg ggt ctt cca cgt atg aaa ggg aca age 355 
Leu Ala He Leu Asn Gin Ala Gly Leu Pro Arg Met Lys Gly Thr Ser 
70 75 80 85 



403 



451 



499 



547 



595 



643 



691 
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Fig. 7 (con't) 

ggt cct tgg gga tta aca gaa gaa ate gat tat gtt tct gtt tgg ggt 739 

Gly Pro Trp Gly Leu Thr Glu Glu lie Asp Tyr Val Ser Val Trp Gly 

200 205 210 

att att ctt gcg aag tct teg etc acc aaa ttc cgt etc att ttt tat 787 

lie lie Leu Ala Lys Ser Ser Leu Thr Lys Phe Arg Leu lie Phe Tyr 

215 220 225 

gtc ttg att etc att tta ttt gtt att tct tgt ggt etc ctt tgg gtc 

Val Leu lie Leu lie Leu Phe Val lie Ser Cys Gly Leu Leu Trp Val 
230 235 240 245 

att tgg aaa act cat act etc att atg act atg gga ggt aca aaa ggg 

lie Trp Lys Thr His Thr Leu lie Met Thr Met Gly Gly Thr Lys Gly 

250 255 260 

ttc ttc aac cct aca cca tat aca aag aat gec ttg gaa gee aag aaa 931 

Phe Phe Asn Pro Thr Pro Tyr Thr Lys Asn Ala Leu Glu Ala Lys Lys 
265 270 275 



835 



883 



1027 



gec gag gga gca get get gac aaa gag aaa aaa gaa gat gca gat tea 979 
Ala Glu Gly Ala Ala Ala Asp Lys Glu Lys Lys Glu Asp Ala Asp Ser 
280 285 290 

cag ggg gaa age aaa aat gcg gaa acc agt gat aaa gac tct agt gat 
Gin Gly Glu Ser Lys Asn Ala Glu Thr Ser Asp Lys Asp Ser Ser Asp 
295 300 305 

aaa gat get cca gaa gga age aat gaa att gag ggt get tagtgactgc 1076 

Lys Asp Ala Pro Glu Gly Ser Asn Glu lie Glu Gly Ala 
310 315 320 

caacactttt ggaactctag acatcttgat gaagcactcc aaggaagatg acctctccag 1136 

gtttcttcct aaaaatcttc ttgttgaatc tcctcatccc gaagaaatcc ctttaaaatc 1196 

ttta 1200 
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Figure 8 (RY-36) 

Restriction analysis of CPN 100424 

Hpyl78III 
Nrul 

Thai HphI 

I I 
TGTTCGCGATTGGCACTAATCCCCCCTTTTGTTATGGTGAATAAAAAGGTATGCGTGGAT 

x + + + + + + 60 

ACAAGCGCTAACCGTGATTAGGGGGGAAAACAATACCACTTATTTTTCCATACGCACCTA 



61 



Tthlllll Mwol 

Dpnl | Rsal | 

Sau3AI | | BsrGI | | 

Drdll Taql| j j BsrDI MslI TatI j | 

I II I I I I I I I 

TATGGTTCGTCGATCTATTTCTTTTTGCTTGTTCTTTCTAATGACATTGCTGTGCTGTAC 

+ + + + + + 

ATACCAAGCAGCTAGATAAAGAAAAACGAACAAGAAAGATTACTGTAACGACACGACATG 



120 



ScrFI 
EcoRII 
Taal 
BsiHKAI 



Maelll 
Alul | 
BspMI j 
Cvi JI j Mwol 
I 



Bspl286I 
BseSI 
CviRI 
ApaLI 
Bbsl 
MboII 
Bsal | 
BsmAI | 
Tsp509I j 



AAGCTGTAACAGCAGGTCTCTAATTGTGCACGGTCTTCCTGGCAGAGAAGCGAATGAGAT 

121 + + + + + + 

TTCGACATTGTCGTCCAGAGATTAACACGTGCCAGAAGGACCGTCTCTTCGCTTACTCTA 



180 
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Fig. 8 (con't) 



Sfcl 
CviJI 
Fnu4HI 
Taul 
Acil 
MspAlI 
Mwol 

PStI 

Fnu4HI 
Mnll 
CviRI 



Bbvl 
Bsgl | 
I 



Tsp509I 
CviRI 
Bce83I 
Fnu4HI 
CviJI | 
Tselj | Bbvl 
I I 



Tsel 
Fnu4HI 
Sfcl 
Alul | 
CviJI j 
Smll Tselj 
I II 



TGTGGTGCTTTTGGTAAGCAAAGGGGTGGCTGCACAAAAATTGCCTCAAGCTGCAGCGGC 



181 



-+ 240 



ACACCACGAAAACCATTCGTTTCCCCACCGACGTGTTTTTAACGGAGTTCGACGTCGCCG 



241 



Ddel 
AlwNI 
RleAI 
Alul 
CviJI 
Fnu4HI 
CjePI | 
Mwol | 
Tsel | 
BseMII 
Mspl 
Bbvl | 
CviJI | 

Mwol || I I I I II Bbvl EcoRV 

TACAGCCGGAGCAGCTACTGAGCAAATGTGGGATATCGCGGTTCCGTCAGCACAAATCAC 

_^ . + + + 300 



CjePI 
NlalV 
Acil | 
Thai j 



Mnll 



ATGTCGGCCTCGTCGATGACTCGTTTACACCCTATAGCGCCAAGGCAGTCGTGTTTAGTG 



CviJI 
Haelll 
CjePI | 
EcoO109I | 
Sau96I | 



SimI 
Acil 
Bbsl 
MboII 
Faul | 
Sthl32l| j 



301 



BsaAI 
CjePI | 

Maell j CviJI 

II I 
AGAGGCCCTTGCCATTCTAAATCAAGCGGGTCTTCCACGTATGAAAGGGACAAGCCTGTT 

+ + + + + + 360 

TCTCCGGGAACGGTAAGATTTAGTTCGCCCAGAAGGTGCATACTTTCCCTGTTCGGACAA 
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Fig. 8 (con't) 

Dpnl 

Bglll | Hpyl78III 
BsmFI | Alul | 

BstYI | CviJI | 

Sau3AI I CviRI Eco57I Hpyl88IX | | Hpyl78III 

III I III I 

AGATCTTTTTGCAAAACAAGGTCTTGTTCCTTCCGAGCTTCAGGAAAAAATCCGTTATCA 

361 + + + + + + 420 

TCTAGAAAAACGTTTTGTTCCAGAACAAGGAAGGCTCGAAGTCCTTTTTTAGGCAATAGT 

Pf 111081 
CviJI 
Hael 

Hpyl88IX Haelll j Bed Fokl 

CviJI I Bed I | Mnll SfaNI TaqI | 

II I I I I I M 

AGAAGGCTTATCAGAACAGATGGCCTCTACGATTAGAAAAATGGATGGCGTTGTCGATGC 

421 + + + + + + 480 

TCTTCCGAATAGTCTTGTCTACCGGAGATGCTAATCTTTTTACCTACCGCAACAGCTACG 

Rsal 

TatI | BseMII Msel AlwNI 

Ddel | | Mnll | Sfcl MboII MboII | CviJI | 

I I I I I I I I I I I 

CTCAGTACAGATTTCCTTCACTACAGAAAATGAAGATAATCTTCCTTTAACAGCCTCTGT 

481 + + + + + + 540 

GAGTCATGTCTAAAGGAAGTGATGTCTTTTACTTCTATTAGAAGGAAATTGTCGGAGACA 

Mnll 
CjePI | 

Msel j M seI 
Mnll | j TaqI SfaNI Hpyl88IX CjePI Tsp509I | 

I I I I I II I' 

GTATATTAAGCATCGAGGGGTTTTGGACAATCCGAACAGCATTATGGTTTCCAAAATTAA 

541 + + + + + + 600 

CATATAATTCGTAGCTCCCCAAAACCTGTTAGGCTTGTCGTAATACCAAAGGTTTTAATT 

Bsp24I Cjel 

CjePI BsmAI CjePI 

Cjel | BsmBI Dpnl 

Haell Cjel ScrFI || Hin4I Cjel| Bsp24l| 

Hhall CviRI | EcoRII | Mae I I Sfclj Sau3AI | | 

II I I I I M IN Ml 

GCGCCTTATTGCAAGTGCTGTTCCAGGACTTGTGCCAGAGAACGTCTCTGTAGTGAGCGA 

601 + + + + + + 660 

CGCGGAATAACGTTCACGACAAGGTCCTGAACACGGTCTCTTGCAGAGACATCACTCGCT 
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Fig. 8 (con't) 



Alul 
CviJI 
Fnu4HI | 

Hhal | I BsaJI 

Tselj I Styl 

Thai | j | Avail 

BplI | | | I Sau96I 

BsiEI | M | Msel | | Tthlllll 

Pvul | | I I Bbvl Vspl || Cjel Clal | 

Sgfl | || | Cjel |Tsp509I | j j Msel | TaqI j 

I I III I I I I I I II II 

TCGCGCAGCTTATAGTGATATTACAATTAATGGTCCTTGGGGATTAACAGAAGAAATCGA 

66 1 + + + + + + 720 

AGCGCGTCGAATATCACTATAATGTTAATTACCAGGAACCCCTAATTGTCTTCTTTAGCT 



Mboll 



Bbsl 
Mboll 



Bsp24I 
CjePI 
Cjel | 
HphI | | 

r 1 1 



Apol 
Tsp509I 



BsmAI 
BsmBI 
I 



TTATGTTTCTGTTTGGGGTATTATTCTTGCGAAGTCTTCGCTCACCAAATTCCGTCTCAT 

721 + + + + + + 780 

AATACAAAGACAAACC C CATAAT AAG AACG CTT C AGAAGCG AGTGGTTT AAGGCAG AGTA 



Hinf I 
Tf il 
Hpyl78III 
Cjel| 
CjePI | 
Bsp24I | j 
II 



Bsal 
BsmAI SimI 

I I 



TTTTTATGTCTTGATTCTCATTTTATTTGTTATTTCTTGTGGTCTCCTTTGGGTCATTTG 

781 + + + + + + 840 

AAAAATACAGAACTAAGAGTAAAATAAACAATAAAGAACACCAGAGGAAACCCAGTAAAC 



Mboll 

Mnll Rsal | 

I I I 

GAAAACTCATACTCTCATTATGACTATGGGAGGTACAAAAGGGTTCTTCAACCCTACACC 

841 + _._, + + - + + + 900 

CTTTTGAGTATGAGAGTAATACTGATACCCTCCATGTTTTCCCAAGAAGTTGGGATGTGG 
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Fig. 8 (con't) 



BsaJI 
CviJI 

BsmI Bbvl | 

BsaJI | CviJI Mwol | j 
Styl | Mwol | Mnll | | | 

Mil II 



AlwNI 
Fnu4HI 

Alul 
CviJI 
MspAlI 
PvuII 
Tsel 
Fnu4HI | 
Tselj | 
Mwol | j | 
I II I 



Sf aNI 



Bbvl 



ATATACAAAGAATGCCTTGGAAGCCAAGAAAGCCGAGGGAGCAGCTGCTGACAAAGAGAA 



901 



TATATGTTTCTTACGGAACCTTCGGTTCTTTCGGCTCCCTCGTCGACGACTGTTTCTCTT 



■+ 960 



MboII 
Hinfl | 
Tf il | 
CviRI | | 
I 



Acil 



Hinfl 
Hin4I | 
TspRI | |BfaI 
BsrI Plel | j |BpmI 
I I I II I 



AAAAGAAGATGCAGATTCACAGGGGGAAAGCAAAAATGCGGAAACCAGTGATAAAGACTC 



961 



- + 1020 



TTTTCTTCTACGTCTAAGTGTCCCCCTTTCGTTTTTACGCCTTTGGTCACTATTTCTGAG 



BsrDI Maelll 

Tsp509l| Tsp45I 

SfaNI Hpyl7 8III Mnll | | Ddel | Bsbl 

| I III II I 

TAGTGATAAAGATGCTCCAGAAGGAAGCAATGAAATTGAGGGTGCTTAGTGACTGCCAAC 



1021 



-+ 1080 



ATCACTATTTCTACGAGGTCTTCCTTCGTTACTTTAACTCCCACGAATCACTGACGGTTG 

ScrFI 

Hpyl78III EcoRII 
Hpyl7 8III | Bpml HgiEII 

Bfal| | BsaJI | Hin4I | 

Xbal|| MslI Stylj CjePI |MboII |MnlI 

III I M III 

ACTTTTGGAACTCTAGACATCTTGATGAAGCACTCCAAGGAAGATGACCTCTCCAGGTTT 



1081 



-+ 1140 



TGAAAACCTTGAGATCTGTAGAACTACTTCGTGAGGTTCCTTCTACTGGAGAGGTCCAAA 



Sthl32I 
Mnll | 
Hpyl78III | j 



Dral 
MboII | 
Msel 



Hinfl 
CjePI | 
BseRI | | 
MboII FokljTfil 

i M I I i i 1 1 

CTTCCTAAAAATCTTCTTGTTGAATCTCCTCATCCCGAAGAAATCCCTTTAAAATCTTTA 

1141 + + + + + + 1200 

GAAGGATTTTTAGAAGAACAACTTAGAGGAGTAGGGCTTCTTTAGGGAAATTTTAGAAAT 
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Figure 9: CPN100426 

ttgaacccta tggaaatgta tcttatttgt gctgggctat atttcttaat gacaacatca 60 

ttttcctgta tttctaggtt atcagaaaag agaaggagtt atg aca att aga gtc 115 

Met Thr lie Arg Val 
1 5 



cga aac ctt gcc tac tct gta aat aag aaa aag att eta gat ggt gta 
Arg Asn Leu Ala Tyr Ser Val Asn Lys Lys Lys lie Leu Asp Gly Val 
10 15 20 



aag aat tat cct gac cag etc tct ggg gga caa aaa caa cgt gtg get 

Lys Asn Tyr Pro Asp Gin Leu Ser Gly Gly Gin Lys Gin Arg Val Ala 
120 125 130 

att gta cgt tct tta tgt atg gat aaa cat aca tta ctt ttt gat gaa 

lie Val Arg Ser Leu Cys Met Asp Lys His Thr Leu Leu Phe Asp Glu 

135 140 145 

cct aca teg get tta gat cct ttt get acg gca teg ttc cga cat ctt 

Pro Thr Ser Ala Leu Asp Pro Phe Ala Thr Ala Ser Phe Arg His Leu 

150 155 160 165 

tta gaa aca ctt cga gac cag gaa ctg act gta ggg tta act act cat 

Leu Glu Thr Leu Arg Asp Gin Glu Leu Thr Val Gly Leu Thr Thr His 
170 175 180 

gac atg caa ttt gtt cat agt tgt ttg gat cgt ate tat ctt ata gat 

Asp Met Gin Phe Val His Ser Cys Leu Asp Arg lie Tyr Leu lie Asp 
185 190 195 



163 



act ttt tct tta gag cga ggg cac att aca ctg ttt gtt ggg aag agt 211 
Thr Phe Ser Leu Glu Arg Gly His lie Thr Leu Phe Val Gly Lys Ser 
25 30 35 

ggt tea gga aaa aca atg att tta cgt get ttg gcg ggc tta gtc cag 259 
Gly Ser Gly Lys Thr Met He Leu Arg Ala Leu Ala Gly Leu Val Gin 
40 45 50 

ccc act caa gga gat att tgg att gaa ggg gag get cca get eta gtt 
Pro Thr Gin Gly Asp He Trp He Glu Gly Glu Ala Pro Ala Leu Val 
55 60 65 

ttc caa caa ccc gag tta ttt tec cat atg aca gta tta gga aat tgc 
Phe Gin Gin Pro Glu Leu Phe Ser His Met Thr Val Leu Gly Asn Cys 
70 75 80 85 

acc cat cca caa ate cat ate aag ggt cgt agt acc gaa gaa get cga 403 
Thr His Pro Gin He His He Lys Gly Arg Ser Thr Glu Glu Ala Arg 
90 95 100 

gaa aag gcg ttc gag ctt tta cat ttg ttg gat att gaa gag gtt get 451 
Glu Lys Ala Phe Glu Leu Leu His Leu Leu Asp He Glu Glu Val Ala 
105 HO 115 



307 



355 



499 



547 



595 



643 



691 
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Fig. 9 (con't) 

caa gga act gtt gcg ggg gtc tat gac aag cgt gac gga gag etc gat 73 9 

Gin Gly Thr Val Ala Gly Val Tyr Asp Lys Arg Asp Gly Glu Leu Asp 
200 205 210 

tct ggt cat cca tta teg aaa tat ate cac tct get caa taggactaca 788 

Ser Gly His Pro Leu Ser Lys Tyr lie His Ser Ala Gin 

215 220 225 

getgetagag cagctgtagt gatactttag aatcctgacc agtggcagga atgageggea 848 
tg 850 
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Figure 10 (RY-37) 

Restriction enzyme analysis of CPN 100426 

CviJI Msel 
I I 

TTGAACCCTATGGAAATGTATCTTATTTGTGCTGGGCTATATTTCTTAATGACAACATCA 

x + + + + + + 60 

AACTTGGGATACCTTTACATAGAATAAACACGACCCGATATAAAGAATTACTGTTGTAGT 



Hpyl8 8IX 

Bfal Hpyl88IX Tsp509I Hinfl jPlel 

ll I lit 

TTTTCCTGTATTTCTAGGTTATCAGAAAAGAGAAGGAGTTATGACAATTAGAGTCCGAAA 

61 + + + + + + 120 

AAAAGGACATAAAGATCCAATAGTCTTTTCTCTTCCTCAATACTGTTAATCTCAGGCTTT 

Hpyl78III 
Bfal | 
Xbal | | 

Hinfl | I I Maelll Mnll 
CjePI Tfil lllBccI | CjePl| 

I I I II II M 

CCTTGCCTACTCTGTAAATAAGAAAAAGATTCTAGATGGTGTAACTTTTTCTTTAGAGCG 

121 + + + + + + 180 

GGAACGGATGAGACATTTATTCTTTTTCTAAGATCTACCACATTGAAAAAGAAATCTCGC 

Bspl286I Hpyl78III Faul 

Tthllllll Earl MboII Sthl32l| 

Bmgl | | TspRI | Drdll | BsaAI | | 

BseSlll Taal 1 | Alol | | Cjel Maell | | | 

| | | III III I INI 

AGGGCACATTACACTGTTTGTTGGGAAGAGTGGTTCAGGAAAAACAATGATTTTACGTGC 

181 + + - — + + + + 240 

TCCCGTGTAATGTGACAAACAACCCTTCTCACCAAGTCCTTTTTGTTACTAAAATGCACG 

Ddel 
Bce83I | 
CviJI | Cjel 
CacSI | | CviJI BslI Mnll NlalV Alul 

Acil | || BspGI | Smll | Bpml |CjeI CviJI | CviJI 

I I II II II III III 

TTTGGCGGGCTTAGTCCAGCCCACTCAAGGAGATATTTGGATTGAAGGGGAGGCTCCAGC 

241 + + + + + + 300 

AAACCGCCCGAATCAGGTCGGGTGAGTTCCTCTATAAACCT.AACTTCCCCTCCGAGGTCG 

Sthl32I 

Cjel | Mmel Tsp509I 

Bfal Acelll Aval | | Ndel | Taal Fokl | CviRI Bed 

| I III II I II 1 I 

TCTAGTTTTCCAACAACCCGAGTTATTTTCCCATATGACAGTATTAGGAAATTGCACCCA 

3d + + + + + + 360 

AGATCAAAAGGTTGTTGGGCTCAATAAAAGGGTATACTGTCATAATCCTTTAACGTGGGT 
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Fig. 10 (con't) 



Pf 111081 
SimI | 
Taqll BslI | |RsaI 
I I 



Mboll 
Hpyl78III 
TaqI 
Aval 
Smll 
Xhol 
Alul | 
CviJI j 



Alul 
CviJI 
Cjel | 
TaqI | 
Mmel | | 
I I I 



TCCACAAATCCATATCAAGGGTCGTAGTACCGAAGAAGCTCGAGAAAAGGCGTTCGAGCT 

361 + + + + + + 420 

AGGTGTTTAGGTATAGTTCCCAGCATCATGGCTTCTTCGAGCTCTTTTCCGCAAGCTCGA 



Hpyl78III 
Tsp509I 
Mboll I 

Mnll Ddel | j 

Earl | Cjel | j j 

II I I I I 

TTTACATTTGTTGGATATTGAAGAGGTTGCTAAGAATTATCCTGACCAGCTCTCTGGGGG 

421 + + + + + + 480 

AAATGTAAACAACCTATAACTTCTCCAACGATTCTTAATAGGACTGGTCGAGAGACCCCC 



BstXI 
Alul | 
CviJI |AceIII 
Hin4l| I Cjel | 
II I II 



BsmFI 

AflHI | Maell 
Mae I I | CviJI Rsal | Cjel 

III II I 

ACAAAAACAACGTGTGGCTATTGTACGTTCTTTATGTATGGATAAACATACATTACTTTT 

481 + + + + + + 540 

TGTTTTTGTTGCACACCGATAACATGCAAGAAATACATACCTATTTGTATGTAATGAAAA 



Dpnl 
BstYI | 

Sau3AI j Bcefl 
Alwl | | SfaNI | 

CviJI | | Hpyl88IX| j 

III M I 

TGATGAACCTACATCGGCTTTAGATCCTTTTGCTACGGCATCGTTCCGACATCTTTTAGA 

541 + + + + + + 600 

ACTACTTGGATGTAGCCGAAATCTAGGAAAACGATGCCGTAGCAAGGCTGTAGAAAATCT 



ScrFI 
EcoRII 
Mmel 
Hpyl78III 
TaqI 
Bsal | 
BsmAI | 
Xmnl j 



Tthlllli 
Tsp509I 
CviRI 



Nlalll 
Nlalll | 



Hpyl78III 



Hindi | 
Taal Hpal j 
AlwNI Sfcl| Msel| Real j 
III II II 



Nspl 



AACACTTCGAGACCAGGAACTGACTGTAGGGTTAACTACTCATGACATGCAATTTGTTCA 



601 



-+ 660 



TTGTGAAGCTCTGGTCCTTGACTGACATCCCAATTGATGAGTACTGTACGTTAAACAAGT 
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Fig. 10(con't) 



Taal 
Faul 
Sthl32I | 
Dpnl | | 

Sau3AI | || |AciI SimI 

II I II II 

TAGTTGTTTGGATCGTATCTATCTTATAGATCAAGGAACTGTTGCGGGGGTCTATGACAA 

661 + + + + + + 720 

ATCAACAAACCTAGCATAGATAGAATATCTAGTTCCTTGACAACGCCCCCAGATACTGTT 



Dpnl 
Sau3AI I Alwl 



Maelll 
Tsp45I 
I 



Hinf I 
Tf il 
Banll 
BsiHKAI 
Bspl286I 
Sad 
TaqI 
Alul | 
CviJI | 
Fokl | | 
I 



TaqI 
I 



Bbvl 



GCGTGACGGAGAGCTCGATTCTGGTCATCCATTATCGAAATATATCCACTCTGCTCAATA 

721 + + + + + + 780 

CGCACTGCCTCTCGAGCTAAGACCAGTAGGTAATAGCTTTATATAGGTGAGACGAGTTAT 



Sfcl 
Alul 
CviJI 
MspAlI 
PvuII 
Fnu4HI 
Mwol 
Tsel 
Mwol 
Bf al 
Fnu4HI 
Alul | 
CviJI j 
MspAlI j 
PvuII | 

Sfcl Tsel j i i j j i i Bbvl 

I 1 1 

GGACTACAGCTGCTAGAGCAGCTGTAGTGATACTTTAGAATCCTGACCAGTGGCAGGAAT 

781 + + + + + + 840 

CCTGATGTCGACGATCTCGTCGACATCACTATGAAATCTTAGGACTGGTCACCGTCCTTA 



Hpyl78III 
Hinfl | 
Tfil BsrI 



TspRI 
BslI I 
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Fnu4HI 
Taul 
Acil | 
BsrBI | Nlalll 

II I 
GAGCGGCATG 

841 + 850 

CTCGCCGTAC 
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Figure 11: CPN190508 

ctctgattta tggtaattct ttattttcag agccgtcaag tcctttctat tctgttgaat 60 

ttcctaataa cgtaagtaat aaacaatcaa aagtccgcat atg aaa aga cct ttt 115 

Met Lys Arg Pro Phe 
1 5 

ttt acc tat eta tgc ate ate ttc tac gga tct tgt gca teg tta tct 163 
Phe Thr Tyr Leu Cys lie lie Phe Tyr Gly Ser Cys Ala Ser Leu Ser 
10 15 20 

tta cat gca gga etc tct ttc cca gaa gta cgt gga get acg get get 211 
Leu His Ala Gly Leu Ser Phe Pro Glu Val Arg Gly Ala Thr Ala Ala 
25 30 35 



gtt gtc cat gee gac tct ggg aag gta ttc tat gat aaa gac ata gat 25 9 

Val Val His Ala Asp Ser Gly Lys Val Phe Tyr Asp Lys Asp lie Asp 

Val Val His Ala Asp Ser Gly Lys Val Phe Tyr Asp Lys Asp lie Asp 
40 45 50 

get gta ate tat cct gee age atg acg aaa ate gca act gec etc ttt 307 

Ala Val lie Tyr Pro Ala Ser Met Thr Lys lie Ala Thr Ala Leu Phe 

Ala Val He Tyr Pro Ala Ser Met Thr Lys He Ala Thr Ala Leu Phe 
55 60 65 

ate eta aag cac tat ccc aca gtc etc gat act etc ate aaa gtc aaa 355 

He Leu Lys His Tyr Pro Thr Val Leu Asp Thr Leu He Lys Val Lvs 

He Leu Lys His Tyr Pro Thr Val Leu Asp Thr Leu He Lys Val Lys 

70 75 * 80 85 

caa gat gcg ate get tec ate act ccg caa gca aaa aaa caa tea gga 403 

Gin Asp Ala He Ala Ser lie Thr Pro Gin Ala Lys Lys Gin Ser Gly 

Gin Asp Ala He Ala Ser He Thr Pro Gin Ala Lys Lys Gin Ser Gly 

90 95 100 

tat cgt agt cct ccc cac tgg tta gaa act gat gga tct aca ata cag 451 

Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp Gly Ser Thr He Gin 

Tyr Arg Ser Pro Pro His Trp Leu Glu Thr Asp Gly Ser Thr He Gin 

105 110 115 

etc cat ctt cga gaa gag ctt tta ggg tgg gac ctg ttc cac gee tta 4 99 

Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp Leu Phe His Ala Leu 

Leu His Leu Arg Glu Glu Leu Leu Gly Trp Asp Leu Phe His Ala Leu 

120 125 130 

ctg gtc tgt tct get aat gat get gcg aat gtc tta get atg gca tgt 547 

Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val Leu Ala Met Ala Cys 

Leu Val Cys Ser Ala Asn Asp Ala Ala Asn Val Leu Ala Met Ala Cys 
135 140 145 

tgc gga tct gta gag aag ttt atg gat aag ctg aac ttc ttc tta aaa 595 

Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu Asn Phe Phe Leu Lys 

Cys Gly Ser Val Glu Lys Phe Met Asp Lys Leu Asn Phe Phe Leu Lys 

150 155 160 165 

gaa gaa ate ggc tgc act cat acc cat ttt aat aat ccc cat ggg tta 64 3 

Glu Glu He Gly Cys Thr His Thr His Phe Asn Asn Pro His Gly Leu 

Glu Glu He Gly Cys Thr His Thr His Phe Asn Asn Pro His Gly Leu 

170 175 130 
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cat cat ccg aat cac tat act aca acc cgt gat ctt att age a:: atg 691 

His His Pro Asn His Tyr Thr Thr Thr Arg Asp Leu lie Ser lie Met 

His His Pro Asn His Tyr Thr Thr Thr Arg Asp Leu lie Ser lie Met 

185 190 195 

cgt tgc get ctg aaa gaa cct cca ttt cga ggg gtc ate tec acg aca 739 

Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly Val lie Ser Thr Thr 

Arg Cys Ala Leu Lys Glu Pro Pro Phe Arg Gly Val lie Ser Thr Thr 

200 205 210 

age tat aaa ata ggg get aca aac ctg cat ggc gaa egg ate eta tec 787 

Ser Tyr Lys lie Gly Ala Thr Asn Leu His Gly Glu Arg lie Leu Ser 

Ser Tyr Lys lie Gly Ala Thr Asn Leu His Gly Glu Arg lie Leu Ser 

215 220 225 

cca aca aac aaa ttg ctt ctt cct ggg tct acc tac cac tat ccc cca 835 

Pro Thr Asn Lys Leu Leu Leu Pro Gly Ser Thr Tyr His Tyr Pro Pro 

Pro Thr Asn Lys Leu Leu Leu Pro Gly Ser Thr Tyr His Tyr Pro Pro 

230 235 240 245 

get tta gga ggg aaa aca ggg acc acc aag act gca ggg aaa aat eta 88 3 

Ala Leu Gly Gly Lys Thr Gly Thr Thr Lys Thr Ala Gly Lys Asn Leu 

Ala Leu Gly Glv Lys Thr Gly Thr Thr Lys Thr Ala Giv Lys Asn Leu 

250 253 " 260 

att atg get get gaa aaa aat aac cgc etc ttg gta acg ate gca acg 931 

lie Met Ala Ala Glu Lys Asn Asn Arg Leu Leu Val Thr lie Ala Thr 

lie Met Ala Ala Glu Lys Asn Asn Arg Leu Leu Val Thr lie Ala Thr 
265 270 275 

ggc tat teg ggt cct gtg agt gat etc tac caa gat gtc att get eta 979 

Gly Tyr Ser Gly Pro Val Ser Asp Leu Tyr Gin Asp Val lie Ala Leu 

Gly Tvr Ser Glv Pro 

280 "* 285 290 

tgt gaa acg gta ttt aac gag ccg eta tta aga aaa gag etc gtc ccc 1027 
Cys Glu Thr Val Phe Asn Glu Pro Leu Leu Arg Lys Glu Leu Val Pro 
295 300 305 

ccc tec gac tgt etc caa tta gaa ata gcg aat ctt ggg aag ctt tct 1075 
Pro Ser Asp Cys Leu Gin Leu Glu lie Ala Asn Leu Gly Lys Leu Ser 
310 315 320 325 

tgc cct ctt cct gag gga etc tac tat gac ttc tat gee tec gaa gat 1123 
Cys Pro Leu Pro Glu Gly Leu Tyr Tyr Asp Phe Tyr Ala Ser Glu Asp 
330 335 340 

cgc gaa cct ctt tct gta tct ttt att gca cat gcg gac gee ttc cct 1171 
Arg Glu Pro Leu Ser Val Ser Phe lie Ala His Ala Aso Ala Phe Pro 
345 350 * 355 

att gaa caa gga gat ctt ctz ggz cat tgg gtt ttt tat gac gat gaa 1219 
lie Glu Gin Gly Asp Leu Leu Gly His Trp Val Phe Tyr Aso Asd Glu 
360 " 365 " 370 
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Fig. 1 1 (con't) 

ggc aag aaa att tct tec cag ccc ttc tat gec cct tgt cgt ttt gag 1267 
Gly Lys Lys lie Ser Ser Gin Pro Phe Tyr Ala Pro Cys Arg Phe Glu 
375 380 385 

cgc act ate aag cct tgg aaa etc tat atg aaa cgt gtc ttc aca teg 1315 

Arg Thr lie Lys Pro Trp Lys Leu Tyr Met Lys Arg Val Phe Thr Ser 
390 395 400 405 

tat aga acc tat atg tct ata acc atg ctg etc atg tat ttt cgc ate 1363 

Tyr Arg Thr Tyr Met Ser He Thr Met Leu Leu Met Tyr Phe Arg He 
410 415 420 

cgc aag cac cgc aag tat aaa aat tea aaa cac tat tct aaa ate 1408 

Arg Lys His Arg Lys Tyr Lys Asn Leu Lys His Tyr Ser Lys He 
425 430 435 

taactttttc ttttaattta taaaaaacca aaggtttatg taagatttgc gcttttcaat 1468 

ccaacaagaa tcccttgtgc gcacattact tt 1500 
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Figure 12 (RY-39) 

Restriction enzyme analysis of CPN 100508 

Bcefl CviJI Apol 

Hpyl88IX Tsp509I | Hpyl88IX | Tsp509I 

| I I I I I 

CTCTGATTTATGGTAATTCTTTATTTTCAGAGCCGTCAAGTCCTTTCTATTCTGTTGAAT 

x + + + + + + 60 

GAGACTAAATACCATTAAGAAATAAAAGTCTCGGCAGTTCAGGAAAGATAAGACAACTTA 

Ndel 

Mae I I Acil | 

I I I 

TTCCTAATAACGTAAGTAATAAACAATCAAAAGTCCGCATATGAAAAGACCTTTTTTTAC 

61 + + + + + + 120 

AAGGATTATTGCATTCATTATTTGTTAGTTTTCAGGCGTATACTTTTCTGGAAAAAAATG 

Hinf I 

Dpnl CviRI 
Nsil BstYI | Nlalll 

CviRI | Sau3AI I CviRI Nspl 

MboII | | SfaNI | | Alwl| SfaNI Plel | 

III III II I l> 

CTATCTATGCATCATCTTCTACGGATCTTGTGCATCGTTATCTTTACATGCAGGACTCTC 

12 i + + + + + + 180 

GATAGATACGTAGTAGAAGATGCCTAGAACACGTAGCAATAGAAATGTACGTCCTGAGAG 

CjePI Hinf I 



BsaAI Fnu4HI 
Bbvl | CviJI | 

Maellj Alul Mwol | 
Rsal | | CviJI Tsel j 
III I II 



Xcml 
Nlalll | 
Bcefl | | 

Plel | | |BslI XmnI 
II I I I 



TTTCCCAGAAGTACGTGGAGCTACGGCTGCTGTTGTCCATGCCGACTCTGGGAAGGTATT 

+ + + + + + 

AAAGGGTCTTCATGCACCTCGATGCCGACGACAACAGGTACGGCTGAGACCCTTCCATAA 



SfaNI Nlalll 
CjePI | CacSI | 



CTATGATAAAGACATAGATGCTGTAATCTATCCTGCCAGCATGACGAAAATCGCAACTGC 

H + + + + + 

GATACTATTTCTGTATCTACGACATTAGATAGGACGGTCGTACTGCTTTTAGCGTTGACG 



Mnll 

Mnll Taal TaqI RleAI | SfaNI 

I i i ii i 

CCTCTTTATCCTAAAGCACTATCCCACAGTCCTCGATACTCTCATCAAAGTCAAACAAGA 

301 + + + + + " + 360 

GGAGAAATAGGATTTCGTGATAGGGTGTCAGGAGCTATGAGAGTAGTTTCAGTTTGTTCT 
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Fig. 12 (con't) 



Tthlllll 
BsiEI 
Pvul 
Sgf I 
Dpnl | 
Sau3AI I 



Cac8I 
Acil | 



Pf 111081 
EcoRV | 
Hpyl78III | | 
Tthlllll I 



Bed 
I 

TGCGATCGCTTCCATCACTCCGCAAGCAAAAAAACAATCAGGATATCGTAGTCCTCCCCA 

361 + + + + + + 420 

ACGCTAGCGAAGGTAGTGAGGCGTTCGTTTTTTTGTTAGTCCTATAGCATCAGGAGGGGT 



Hpyl78III 
Acelll 

Dpnl Alul TaqI 

Bsri BstYI | CviJI Earl | 

TspRI Sau3AI | MboII | Sapl j 

Mnll | Bed | | Alwl | |BCCI | j 

II III I I I I II 



Alul 
CviJI 
I 



MboII 



CTGGTTAGAAACTGATGGATCTACAATACAGCTCCATCTTCGAGAAGAGCTTTTAGGGTG 

421 + + + + + + 480 

GACCAATCTTTGACTACCTAGATGTTATGTCGAGGTAGAAGCTCTTCTCGAAAATCCCAC 



NlalV 
Avail | 
ECO0109I j 
PspSII j 
Sau$>61 | 
I I 



BsmFI 



Bbvl 
SfaNI 
Bsri | 



I 



Fnu4HI 
Tsel | 
Mwol I I 

I 1 1 



Alul 
CviJI 
Ddel | 
I I 



GGACCTGTTCCACGCCTTACTGGTCTGTTCTGCTAATGATGCTGCGAATGTCTTAGCTAT 

481 + + + + + + 540 

CCTGGACAAGGTGCGGAATGACCAGACAAGACGATTACTACGACGCTTACAGAATCGATA 



Sfcl 
Dpnl 
BstYI 
Sau3AI 
Acil | 
Nlalll | j 
Nspl j j 
I 



Alul 
CviJI 
MboII 



Bbvl 
Bsgl | 
Msel | j 



| Alwl MboII XmnI 

I I I Ml 

GGCATGTTGCGGATCTGTAGAGAAGTTTATGGATAAGCTGAACTTCTTCTTAAAAGAAGA 

541 + + + --- + + + 600 

CCGTACAACGCCTAGACATCTCTTCAAATACCTATTCGACTTGAAGAAGAATTTTCTTCT 
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Fig. 12 (con't) 



CviRI 
MboII 
Fnu4HI | 
Cvi JI | j 
Tsel j | 
Ml 



Msel 



Maelll 
Nlalll 
BsaJI 
BstDSI 
NCOI 

Styl 
Fokl | 

I I 



Hinf I 
Tf il 
Hpyl88IX | 



601 



AATCGGCTGCACTCATACCCATTTTAATAATCCCCATGGGTTACATCATCCGAATCACTA 

+ + + + + + 

TTAGCCGACGTGAGTATGGGTAAAATTATTAGGGGTACCCAATGTAGTAGGCTTAGTGAT 

Hpyl8 8IX 



660 



Sthl32I 
Dpnl | 
Sau3AI | | 
BscGI | j j 
I I II 



Sf aNI 
Nlalll | Hhal 



Mnll 
BslI | 
Mnll TaqI | j 
I III 



661 



TACTACAACCCGTGATCTTATTAGCATCATGCGTTGCGCTCTGAAAGAACCTCCATTTCG 

, h | | H h 

ATGATGTTGGGCACTAGAATAATCGTAGTACGCAACGCGAGACTTTCTTGGAGGTAAAGC 



720 



Dpnl 
NlalV 
BamHI 
BstYI 
Sau3AI 
Alwl 
BspMl | 

Hin4I Alul Nlalll | | 

SimI CviJI CviJI CviRI | | | 



CviJI 
I 

AGGGGTCATCTCCACGACAAGCTATAAAATAGGGGCTACAAACCTGCATGGCGAACGGAT 



721 



■+ 780 



TCCCCAGTAGAGGTGCTGTTCGATATTTTATCCCCGATGTTTGGACGTACCGCTTGCCTA 



Alwl 



MboII 
Tsp509I 
I 



AccI 
SimI 
ScrFI 
BsaJI | 
EcoRII | | 
Tthlllll j | 
I I I 



BslI 
Alul | 
CviJI j 
Mnll 



CCTATCCCCAACAAACAAATTGCTTCTTCCTGGGTCTACCTACCACTATCCCCCAGCTTT 



781 



-+ 840 



GGATAGGGGTTGTTTGTTTAACGAAGAAGGACCCAGATGGATGGTGATAGGGGGTCGAAA 



PstI 

NlalV CviRI | Fnu4HI 

Avail] BsmFl| | Tsp509I CviJI | 

Sau96l| Sfcl|| | Bbvl | Tsel j 

II 111 I II M 

AGGAGGGAAAACAGGGACCACCAAGACTGCAGGGAAAAATCTAATTATGGCTGCTGAAAA 

841 + + + + + + 900 

TCCTCCCTTTTGTCCCTGGTGGTTCTGACGTCCCTTTTTAGATTAATACCGACGACTTTT 
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Fig. 12 (con't) 



CviJI 
BscGI 
Sthl32I 
BsiEI 
Pvul 
Dpnl 



Sau3AI 
Sthl32I 

Maelll III || | Sau96I | Dpnl 

Acil BslI |Mnll| II III SimI | Sau3AI 

I I I II 

AAATAACCGCCTCTTGGTAACGATCGCAACGGGCTATTCGGGTCCTGTGAGTGATCTCTA 

901 + + + + + + 960 

TTTATTGGCGGAGAACCATTGCTAGCGTTGCCCGATAAGCCCAGGACACTCACTAGAGAT 



NlalV 
Avail 
ECOO109I 
PspSII 
Sau96I 
SimI 



Banll 
BsiHKAI 

Acil Bspl286I 
' Fnu4HI Sad 
Taul BsmFI Alul | 

BsrDI Taal Msel Cvi JI | Msel CviJI j 

i i i ii I ii 

CCAAGATGTCATTGCTCTATGTGAAACGGTATTTAACGAGCCGCTATTAAGAAAAGAGCT 

961 + + + + + + 

GGTTCTACAGTAACGAGATACACTTTGCCATAAATTGCTCGGCGATAATTCTTTTCTCGA 



1020 



BsmAI 
Tsp509I 
Mnll | 
Hpyl88IX Taal | j 
I I ! 



BseMll 
MboII 

Hinfl Alul | 

Tfil CviJI | 

Mmel| Hindlll | | 
II Ml 



CGTCCCCCCCTCCGACTGTCTCCAATTAGAAATAGCGAATCTTGGGAAGCTTTCTTGCCC 

1021 + + + + + + 1080 

GCAGGGGGGGAGGCTGACAGAGGTTAATCTTTATCGCTTAGAACCCTTCGAAAGAACGGG 



Hinfl 
Mnll 
Bsu36I 
Ddel 
Earl 
Hpyl78III 
Mnll Plel 
I 



BsmFI 



Hpyl78III 
Nrul 
Thai 
Mnll 
Dpnl | 
Sau3AI | | 
Hpyl8 8IX | j | |MboII 
I I Ml I 



Mnll 



TCTTCCTGAGGGACTCTACTATGACTTCTATGCCTCCGAAGATCGCGAACCTCTTTCTGT 



1081 



-+ 1140 



AG AAGG ACTC CCTG AG ATG AT ACTG AAG AT ACGG AGG C TT C TAG CG CTTGG AG AAAG AC A 
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Fig. 12 (con't) 

Dpnl 

BsaHI Bglll 
Acil | BstYI 
Nlalll | Sau3AI 
CviRI Nspl | Hgal MboII | 

ill i ii 

ATCTTTTATTGCACATGCGGACGCCTTCCCTATTGAACAAGGAGATCTTCTTGGTCATTG 

1141 + + + + + + 1200 

TAGAAAATAACGTGTACGCCTGCGGAAGGGATAACTTGTTCCTCTAGAAGAACCAGTAAC 

Apol 
Tsp509I 
MboII | CviJI 

I I I 
GGTTTTTTATGACGATGAAGGCAAGAAAATTTCTTCCCAGCCTTTCTATGCCCCTTGTCG 

X201 + + + + + + 1260 

CCAAAAAATACTGCTACTTCCGTTCTTTTAAAGAAGGGTCGGAAAGATACGGGGAACAGC 

Af 1III 

BsaJI Maell 
Styl Bbsl | 

Hhal CviJI | MboII j 

I II II 

TTTTGAGCGCACTATCAAGCCTTGGAAACTCTATATGAAACGTGTCTTCACATCGTATAG 

1261 + + + + + + 1320 

AAAACTCGCGTGATAGTTCGGAACCTTTGAGATATACTTTGCACAGAAGTGTAGCATATC 



Nlalll Acil 



Fokl 
Fnu4HI | 
Nlalll) | 
Bbvl Tselj | 

I I I 



Mwol 
SfaNI ] 
Cac8l| | 
Acil | | | 
I II I 



AACCTATATGTCTATAACCATGCTGCTCATGTATTTTCGCATCCGCAAGCACCGCAAGTA 

1221 + + + + + + 1380 

TTGGATATACAGATATTGGTACGACGAGTACATAAAAGCGTAGGCGTTCGTGGCGTTCAT 

Apol 

Tsp509I Dral Tsp509I 
Tthlilll |Msel| Msel | 

i i ii ii 

TAAAAATTTAAAACACTATTCTAAAATCTAACTTTTTCTTTTAATTTATAAAAAACCAAA 

1381 + + + + + + 1440 

ATTTTTAAATTTTGTGATAAGATTTTAGATTGAAAAAGAAAATTAAATATTTTTTGGTTT 

CjePI 

CjePI Hinfl Hhal | 

Hhal 1 Tfil ' Fspl|Mmel| 

ii I M M 

GGTTTATGTAAGATTTGCGCTTTTCAATCCAACAAGAATCCCTTGTGCGCACATTACTTT 

1441 + + + - + + + 1500 

CCAAATACATTCTAAACGCGAAAAGTTAGGTTGTTCTTAGGGAACACGCGTGTAATGAAA 
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aaggagcaaa tggagattgg ccaaatagac gagcaagggt ttgcataaga atagcctttt 60 

tcgcaataat aacttgccta aacgatcttg taaacgactt atg get tct aat ccc 115 

Met Ala Ser Asn Pro 
1 5 

att tta cag ata gag gat eta tec ata acc ttg gca aaa caa cgc caa 163 
lie Leu Gin lie Glu Asp Leu Ser lie Thr Leu Ala Lys Gin Arg Gin 



lie 


Leu 


Gin 


lie 


Glu 
10 


Asp 


Leu 


Ser 


He 


Thr 
15 


Leu 


Ala 


Lys 


Gin 


Arg 
20 


Gin 




cag 
Gin 
Gin 


tac 
Tyr 
Tyr 


ccc 
Pro 
Pro 


ate 
lie 
lie 
25 


gtc 
Val 
Val 


caa 
Gin 
Gin 


tct 
Ser 
Ser 


tta 
Leu 
Leu 


teg 
Ser 
Ser 
30 


ttt 
Phe 
Phe 


act 
Thr 
Thr 


ate 
He 
He 


aat 
Asn 
Asn 


gaa 
Glu 
Glu 
35 


gga 
Gly 
Gly 


caa 
Gin 
Gin 


211 


acc 
Thr 
Thr 


tta 
Leu 
Leu 


gca 
Ala 
Ala 
40 


ate 
He 
He 


att 
He 
He 


gga 
Gly 
Gly 


gaa 
Glu 
Glu 


tea 
Ser 
Ser 
45 


gga 
Gly 
Gly 


tea 
Ser 
Ser 


gga 
Gly 
Gly 


aaa 
Lys 
Lys 


tct 
Ser 
Ser 
50 


gtc 
Val 
Val 


tct 
Ser 
Ser 


gcg 
Ala 
Ala 


259 


cat 
His 
His 


gca 
Ala 
Ala 
55 


ate 
lie 
lie 


ctt 
Leu 
Leu 


cga 
Arg 
Arg 


tta 
Leu 
Leu 


ctt 
Leu 
Leu 
60 


cct 
Pro 
Pro 


tgc 
Cys 
Cys 


ccc 
Pro 
Pro 


cca 
Pro 
Pro 


Phe 
Phe 
65 


tct 
Ser 
Ser 


gtt 
Val 
Val 


tct 
Ser 
Ser 


ggc 
Gly 
Gly 


3C7 


cag 
Gin 
Gin 
70 


gtc 
Val 
Val 


aac 
Asn 
Asn 


ttc 
Phe 
Phe 


caa 
Gin 
Gin 


ggc 
Glv 
Gly 
75 


cac 
His 
His 


aac 
Asn 
Asn 


tta 
Leu 
Leu 


ctt 
Leu 
Leu 


acg 
Thr 
Thr 
80 


get 
Ala 
Ala 


teg 
Ser 
Ser 


cgc 
Arg 
Arg 


tct 
Ser 
Ser 


ata 
He 
lie 
85 


355 


caa 
Gin 
Gin 


aaa 
Lys 
Lys 


aag 
Lys 
Lys 


att 
lie 
He 


ata 
lie 
He 
90 


ggg 

Gly 
Gly 


aca 
Thr 
Thr 


gaa 
Glu 
Glu 


att 
He 
He 


tct 
Ser 
Ser 
95 


atg 
Met 
Met 


ate 
He 
He 


ttt 
Phe 
Phe 


caa 
Gin 
Gin 


aac 
Asn 
Asn 
100 


ccg 
Pro 
Pro 


403 


caa 
Gin 
Gin 


gca 
Ala 
Ala 


tct 
Ser 
Ser 


eta 
Leu 
Leu 
105 


aac 
Asn 
Asn 


ccc 
Pro 
Pro 


gtg 
Val 
Val 


ttt 
Phe 
Phe 


act 
Thr 
Thr 
110 


att 
He 
He 


gaa 
Glu 
Glu 


cag 
Gin 
Gin 


cag 
Gin 
Gin 


ttt 
Phe 
Phe 
115 


cga 
Arg 
Arg 


gaa 
Glu 
Glu 


451 



att 


att 


cat 


acc 


cac 


eta 


gee 


tta 


act 


gca 


gaa 


gtt 


get 


aaa 


gaa 


aag 


499 


He 


He 


His 


Thr 


His 


Leu 


Ala 


Leu 


Thr 


Ala 


Glu 


Val 


Ala 


Lys 


Glu 


Lys 




lie 


He 


His 


Thr 


His 


Leu 


Ala 


Leu 


Thr 


Ala 


Glu 


Val 


Ala 


Lys 


Glu 


Lys 
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atg 


tta 


tac 


get 


ctt 


gaa 


gaa 
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ggg 


ttt 


cat 


gat 
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agg 


ctg 


tgc 
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Met 


Leu 


Tyr 


Ala 


Leu 


Giu 


Giu 


Thr 


Gly 


Phe 
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Pro 
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Leu 


Cys 




Met 


Leu 


Tyr 


Ala 


Leu 


Giu 


Giu 


Thr 


Gly 


Phe 


His 


Aso 


Pro 
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Leu 


Cys 
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140 
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ttg 


aat 


etc 


tac 


ccc 


cac 


caa 


etc 


tct 


gga 


ggg 


atg 


ctt 


caa 


aga 


att 


595 


Leu 


Asn 


Leu 


Tyr 


Pro 


His 


Gin 


Leu 


Ser 


Gly 


Gly 


Met 


Leu 


Gin 
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Leu 
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Leu 


Tyr 


Pro 


His 


Gin 


Leu 


Ser 
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Fig. 13 (con't) 



tgc att gcc atg gcg etc etc tgt tot cct aaa ctt ctt att get gat 64 3 

Cys lie Ala Met Ala Leu Leu Cys Ser Pro Lys Leu Leu lie Ala Asp 

Cys lie Ala Met Ala Leu Leu Cys Ser Pro Lys Leu Leu lie Ala Asp 
170 175 180 



gaa cct acg act get tta gat gtt tet gtt cag tat cag att eta ca , cqi 

Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin Tyr gIS SS Leu Sn 

Glu Pro Thr Thr Ala Leu Asp Val Ser Val Gin Tyr Gla lie ten S£ 

185 190 195 



240 



245 



cat aat cct tct cat ccc tat acc cga gat ctt tta gca tee aca c— 

His Asn Pro <?<*r- Hi <? P^-o th. a — n „ ?. Cwt - 

Arg 

Arg 

260 



5i« p r ° u 1S n r ° Tyr Thr ^ g As ? Leu Leu Ala Ser £?5 Pro 

Hxs Asn Pro Ser Hxs Pro Tyr Thr Arg Asp Leu Leu Ala Ser Arg Pro 
250 255 * 



275 



Pro Pro His Tyr Thr Ala Phe Pro Ser oVy Cyl £g t£ 2£ ^ £ ? 
280 285 290 

clt tit ?" ?r T tta ^ 5t tCt gCg gaa gct cca ate tat 

Cys Ser Lys lie Leu Asn Arg Cys Ser Ala Glu Ala Pro Glu lie Tyr 

Cys Ser Lys He Leu Asn Arg Cys Ser Ala Glu Ala Pro Glu it! 

Z3b 300 305 

ccg gta cgc gaa ggt cac aaa gta agg gut ggc tgt atg acg ac^ aat 

Pro Val Arg Glu Gly His Lys Val Arg Val Gly Cys Me? Thr a™ 

Pro Val Arg Glu Gly His Lys Val Arg Val Gly «& ?fc j£ 

315 320 325 



739 



tta eta aaa aca eta cag aaa aaa acg gga atg age ctt ctt att at- 
Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Mel Ser Leu Leu He He 
Leu Leu Lys Thr Leu Gin Lys Lys Thr Gly Met Ser Leu Leu He ill 

acc cat aat atg gga gtc gtt gca gaa act get gat gac gtg etc ata 7fl7 

Thr Hi s Asn Met Gly Val Val Ala Glu Thr Xla Asp Xsp Val LeS Val 

Thr His Asn Met Gly Val Val Ala Glu Thr Ala Aso As? Val Su Vat 
213 220 225 

etc tat gca gga cgc atg gta gaa tgt gcc cct gcg att caa atr, t-rr- otc 
Leu Tyr Ala Gly Arg Met Val Glu Cys lla Pro 111 v" tin Me"? 835 
Leu Tyr Ala Gly Arg Met Val Glu Cys Ala Pro Ala Val Gin Met Phe 



883 



tit t o g Cta g5t tCC ttc aac ccc att cca gga cag 931 

Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn Pro He Pro Gly GlS 

Ser Leu Gin Pro Gin Gin Leu Gly Ser Phe Asn Pro He Pro Gly Gin 

265 270 



ccc cca cac tac acg gee ttt ccc teg gga tgt cgc tat cac cct aga 97 9 
Pro Pro His Tyr Thr Ala Phe Pro Ser Gly Cys Arg Tyr His P-o 25 

Pro Pro His Tvr Thr ai= Dho o o__ - y ~ y " AS r -° ^9 



1027 



1075 



o£ U n CC ^f a CCt tta att Caa gca acc tca tta aca aag cac tat tic 1123 

Phe Pro Gin Pro Leu He -Gin Ala Thr Ser Leu Thr LyI Hif Ty- T J-~ 

Phe Pro Gin Pro Leu lie Gin Ala Thr Ser Leu Thr Lvs Hi! w 
330 335 * 3^5 7 
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Fig. 13 (con't) 

aag cgt tec ttt tgg ttt cag gga aag aca att gec agt cgt cct gtt 1171 

Lys Arg Ser Phe Trp Phe Gin Gly Lys Thr He Ala Ser Arg Pro Val 

Lvs A^-a Se^- Phe Trp Phe Gin Gly Lys Thr He Ala Ser Arg Pro Val 
* 345 350 355 

gac gac gtc tct ttt tea eta tac tec aga cgt get gtc gga ctt att 1219 

Asp Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg Ala Val Gly Leu He 

Aso Asp Val Ser Phe Ser Leu Tyr Ser Arg Arg Ala Val Gly Leu He 

360 365 370 

gga gaa tct gga tea ggg aaa agt ace ctg gcg tta get etc gca ggt 1267 

Gly Glu Ser Gly Ser Gly Lys Ser Thr Leu Ala Leu Ala Leu Ala Gly 

Gly G^u Ser Gly Ser Gly Lys Ser Thr Leu Ala Leu Ala Leu Ala Gly 
375 380 385 

etc eta cct etc acc tct ggg ttc tta act ttt aac ggc ace cca ate 1315 

Leu Leu P-o Leu Thr Ser Gly Phe Leu Thr Phe Asn Gly Thr Pro He 

Leu Leu P-o Leu Thr Ser Gly Phe Leu Thr Phe Asn Gly Thr Pro lie 

395 400 405 



390 



440 



1363 



aag ttg cat tct aaa cac gga cgc cat caa tta cga tct caa gta egg 

Lys Leu H<s Ser Lys His Gly Arg His Gin Leu Arg Ser Gin Val Arg 

Lvs Leu His Ser Lys His Gly Arg His Gin Leu Arg Ser Gin Val Arg 
y 410 415 420 

t^g at- t-" caa aat cea caa get tea tta aac ccg cga aaa act ate 1411 

Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Arg Lys Thr He 

Leu Val Phe Gin Asn Pro Gin Ala Ser Leu Asn Pro Arg Lys Thr He 
425 430 435 

eta gat agt tta ggc cac tct ctg ctt tac cat aaa etc gtc cea aaa 

Leu Asd Ser Leu Gly His Ser Leu Leu Tyr His Lys Leu Val Pro Lys 

Leu Asd Ser Leu Gly His Ser Leu Leu Tyr His Lys Leu Val Pro Lys 
' 445 450 



1459 



1507 



gaa aaa gta eta gca acg gta agg gaa tat tta gaa ttg gta ggg tta 

Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu Glu Leu Val Gly Leu 

Glu Lys Val Leu Ala Thr Val Arg Glu Tyr Leu Glu Leu Val Gly Leu 
455 460 465 

tct gag gag tat ttt tat cgt tat cct cac cag ctt tct gga gga caa 1555 

Se-* Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin Leu Ser Gly Gly Gin 

Se^ Glu Glu Tyr Phe Tyr Arg Tyr Pro His Gin Leu Ser Gly Gly Gin 

470 475 480 485 

caa caa cca gtc tct ata gcg aga gee eta tta gga gtc cct cag tta 1603 

Gin Gin Arg Val Ser He Ala Arg Ala Leu Leu Gly Val Pro Gin Leu 

Gin Gin Arg Val Ser lie Ala Arg Ala Leu Leu Gly Val Pro Gin Leu 

490 495 500 

att att tgt gac gaa att gtt tct get eta gat tta tct att caa gca 1651 

He He Cys Asp Glu He Val Ser Ala Leu Asp Leu Ser He Gin Ala 

ti s He Cvs Aso Glu He Val Ser Ala Leu Asp Leu Ser lie Gin Ala 

- 505 510 515 

caa att ctg aat atg ctt gee gag ctg caa aaa aaa etc age etc aca 1699 

Gin lie Leu Asn Met Leu Ala Glu Leu Gin Lys Lys Leu Ser Leu Thr 

Gin lie Leu Asn Men Leu Ala Glu Leu Gin Lys Lys Leu Ser Leu Thr 

520 525 530 
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Fig. 13 (con't) 

tat etc ttc att teg cat gat ctt gec gtt gta cgc teg ttc tgc aca 1747 

Tyr Leu Phe lie Ser His Asp Leu Ala Val Val Arg Ser Phe Cys Thr 

Tyr Leu Phe lie Ser His Asp Leu Ala Val Val Arg Ser Phe Cys Thr 

535 540 545 

gag gta ttc att atg tat aag ggg caa att gta gaa aaa gga aat aca 1795 

Glu Val Phe lie Met Tyr Lys Gly Gin lie Val Glu Lys Gly Asn Thr 

Glu Val Phe lie Met Tyr Lys Gly Gin lie Val Glu Lys Gly Asn Thr 
550 555 560 565 

aaa cgc att ttt tct gat cca caa cat cct tat acg cgc atg ttg tta 1843 

Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr Thr Arg Met Leu Leu 

Lys Arg lie Phe Ser Asp Pro Gin His Pro Tyr Thr Arg Met Leu Leu 

570 575 580 

aat gec caa ctt cca gag act cct gat caa agg caa tct aaa cct ata 1891 

Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg Gin Ser Lys Pro lie 

Asn Ala Gin Leu Pro Glu Thr Pro Asp Gin Arg Gin 

585 590 595 

ttc caa gaa tat cac aaa gat tct gaa gaa tct tgc tct aca gga tgc 1939 
Phe Gin Glu Tyr His Lys Asp Ser Glu Glu Ser Cys Ser Thr Gly Cys 
600 605 610 

tac ttt tac aat cgt tgt cca caa aaa caa gaa get tgc aag tea gag 1987 
Tyr Phe Tyr Asn Arg Cys Pro Gin Lys Gin Glu Ala Cys Lys Ser Glu 
615 620 625 

ate ate cca aat caa gga gac gcg cac cat aca tac cgt tgt ate cat 2035 
He He Pro Asn Gin Gly Asp Ala His His Thr Tyr Arg Cys He His 
"0 635 640 645 

tgattegtec tetaegctat tcttaagcta ccattaagga atcccaaggg agaggtctgc 2095 
tctat 2100 
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Figure 14(RY-40) 

Restriction enzyme analysis of CPN 100515 



CviJI 
Hael 
Haelll 

MSCl 

Eael I 



CviJI Mwol 



CviRI 

I I III 
AAGGAGCAAATGGAGATTGGCCAAATAGACGAGCAAGGGTTTGCATAAGAATAGCCTTTT 



TTCCTCGTTTACCTCTAACCGGTTTATCTGCTCGTTCCCAAACGTATTCTTATCGGAAAA 



-+ 60 



Dpnl 
Cjel| 

Sau3AI | | CviJI Cjel 

III I I 

TCGCAATAATAACTTGCCTAAACGATCTTGTAAACGACTTATGGCTTCTAATCCCATTTT 



61 



AGCGTTATTATTGAACGGATTTGCTAGAACATTTGCTGAATACCGAAGATTAGGGTAAAA 



■+ 120 



Dpnl 
Hin4I 
BstYI 
Sau3AI 
HaelV | 
Mnll Hin4I I 
I I 



BstXI 
BsaJI | 
Alwl Styl j 
I 



Bed 
CjePI | 
Rsal | j 
Taal | | | 
II I I 



ACAGATAGAGGATCTATCCATAACCTTGGCAAAACAACGCCAACAGTACCCCATCGTCCA 



121 



TGTCTATCTCCTAGATAGGTATTGGAACCGTTTTGTTGCGGTTGTCATGGGGTAGCAGGT 

Dpnl 
Sau3AI 
Hpyl78III | 
BpulOI Hinf I | j 

Ddel Tf il | j 

I I M 

ATCTTTATCGTTTACTATCAATGAAGGACAAACCTTAGCAATCATTGGAGAATCAGGATC 



-+ 180 



CjePI 



181 



TAGAAATAGCAAATGATAGTTACTTCCTGTTTGGAATCGTTAGTAACCTCTTAGTCCTAG 



■+ 240 



Alwl 

Hpyl78III | 
I 



CviRI 
Nlalll 
Nspl 
SphI 
Cac8I 
Hhal | 
Pspl| | 
BsmAI | | | 

I II I 



TaqI 



Cjel 
I 



AGGAAAATCTGTCTCTGCGCATGCAATCCTTCGATTACTTCCTTGCCCCCCATTTTCTGT 



241 



-+ 300 



TCCTTTTAGACAGAGACGCGTACGTTAGGAAGCTAATGAAGGAACGGGGGGTAAAAGACA 
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Fig. 14 (con't) 



CviJI 
Hael 
Haelll 
BsaJI 
Cjel 
Hindi 



ScrFI 
CviJI 
EcoRII 
Hael 
Haelll 
MscI 
Eael I 



Styl 



Hhal 

CviJI Thai | Bcefl 
III I 



TTCTGGCCAGGTCAACTTCCAAGGCCACAACTTACTTACGGCTTCGCGCTCTATACAAAA 



301 



AAG AC CGGT CC AGTTG AAGGTTCCGGTGTTGAATG AATG C CG AAGCGCG AGAT ATGTTTT 

Faul 

Dpnl Sthl32l| BscGI 

Apol Sau3AI | Cac8I | | Tthlllll 

Tsp509I BsmFI | Acil | | | SfaNI | 

I III I I M II 

AAAGATTATAGGGACAGAAATTTCTATGATCTTTCAAAACCCGCAAGCATCTCTAAACCC 



-+ 360 



361 



TTTCTAATATCCCTGTCTTTAAAGATACTAGAAAGTTTTGGGCGTTCGTAGAGATTTGGG 

CviRI 
Sf cl 
Mwol 
Msel | 
CviJI | | 
Bfal | | | 



-+ 420 



XmnI 
Tsp509I | 
Hpyl78III | | 
Sthl32I TaqI | j j 

I Mil 

CGTGTTTACTATTGAACAGCAGTTTCGAGAAATTATTCATACCCACCTAGCCTTAACTGC 



421 



GCACAAATGATAACTTGTCGTCAAAGCTCTTTAATAAGTATGGGTGGATCGGAATTGACG 



-+ 480 



ScrFI 
BsaJI 
EcoRII 
Tthlllll 

Dpnl 
Nlalll 
Sau3AI 
Hpyl78III | 

BstAPI RcaI I I 

MWOl AlwI I I ! 

PstI I Hpyl78III MboII | | 

II i ill 

AGAAGTTGCTAAAGAAAAGATGTTATACGCTCTTGAAGAAACAGGGTTTCATGATCCCAG 

481 + + + + + + 540 

TCTTCAACGATTTCTTTTCTACAATATGCGAGAACTTCTTTGTCCCAAAGTACTAGGGTC 
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Fig. 14 (con't) 



BsrDI 
CviRI 

Hpyl78III Bpml 

BstXI | Fokl 

CjePI SfaNI | | Apol| 

Hinfl | Mill 1 1 | | Tsp509l| 

CviJI Tfil | Xcmlj j j CjePI || 

| II II I I IN 

GCTGTGCTTGAATCTCTACCCCCACCAACTCTCTGGAGGGATGCTTCAAAGAATTTGCAT 

. , + + + + 600 



541 



CGACACGAACTTAGAGATGGGGGTGGTTGAGAGACCTCCCTACGAAGTTTCTTAAACGTA 



Haell 
Hhal 
Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 
BseRI | 
MslI j 



I 



601 



Mnll Pflll08I 

I I 

TGCCATGGCGCTCCTCTGTTCTCCTAAACTTCTTATTGCTGATGAACCTACGACTGCTTT 

+ + + _ + + + 660 

ACGGTACCGCGAGGAGACAAGAGGATTTGAAGAATAACGACTACTTGGATGCTGACGAAA 



Hinfl 
Tfil 

Hpyl88IX |Tsp509I 



Sthl32I 
Sf cl | 



BscGI 



661 



AGATGTTTCTGTTCAGTATCAGATTCTACAATTACTAAAAACACTACAGAAAAAAACGGG 

+ + + + + + 720 

TCTACAAAGACAAGTCATAGTCTAAGATGTTAATGATTTTTGTGATGTCTTTTTTTGCCC 



AlwNl 

Hinfl BstAPI 
BslI | CviRI | 

CviJI PflMI | Plel Mwol Maell 

I I I I I I 

AATGAGCCTTCTTATTATTACCCATAATATGGGAGTCGTTGCAGAAACTGCTGATGACGT 



721 



■+ 780 



TTACTCGGAAGAATAATAATGGGTATTATACCCTCAGCAACGTCTTTGACGACTACTGCA 



BsiHKAI 
Bspl286I 
BsiHKAI | 
Bspl286I j 

BssSl| j CviRI 

II 



Bspl286I 
Hgal Bmgl | 
Nlalll | BseSI |AciI CjePI 
I I 



Fokl 



781 



GCTCGTGCTCTATGCAGGACGCATGGTAGAATGTGCCCCTGCGGTTCAAATGTTCCATAA 

+ + + + + + 840 

CGAGCACGAGATACGTCCTGCGTACCATCTTACACGGGGACGCCAAGTTTACAAGGTATT 
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Fig. 14 (con't) 



CjePI 
Sthl32I 
Dpnl 
Bglll 
BstYI 
Sau3AI 
CjePI | 
Fokl | j 
Aval | | | 

TCCTTCTCATCCCTATACCCGAGATCTTTTAGCATCCAGACCCTCTCTACAACCGCAACA 



SfaNI 
SimI | 
Hpyl78IIl| | 



Acil 
Mnll | 



841 



- + 900 



AGGAAGAGTAGGGATATGGGCTCTAGAAAATCGTAGGTCTGGGAGAGATGTTGGCGTTGT 



Beef I 
Hpyl78III 

Aval 
BsaJI 



Bfal 
CjePI |NlaIV 



901 



RleAI 
Sthl32I | 
Cvi JI | | 
Haelll | 



ScrFl 
BslI | 
ECORII I j CviJI 

III I 'I 1 

ACTAGGTTCCTTCAACCCCATTCCAGGACAGCCCCCACACTACACGGCCTTTCCCTCGGG 

+ + + + + + 960 

TGATCCAAGGAAGTTGGGGTAAGGTCCTGTCGGGGGTGTGATGTGCCGGAAAGGGAGCCC 



SfaNI 
Mnll | Bfal 
HphI | | Fokl | 
I I I I I 



Bpml 
Clal | 
Apol Dral | j 
Tsp509I Msel|TaqI j 
I II I I 



Hpyl78III 
Alul | 
Acil CviJI | 
I I I 



961 



ATGTCGCTATCACCCTAGATGCTCAAAAATTTTAAATCGATGTTCTGCGGAAGCTCCAGA 

^ ^ + + + + 1020 



TAi 



CAGCGATAGTGGGATCTACGAGTTTTTAAAATTTAGCTACAAGACGCCTTCGAGGTCT 



Maelll 
Tsp45I 
Mspl Thai | 
BsaWllRsal I 



CviJI 



Tsp509I 



AAT 



CTATCCGGTACGCGAAGGTCACAAAGTAAGGGTTGGCTGTATGACGACTAATTTTCC 



1021 



1080 



TTAGATAGGCCATGCGCTTCCAGTGTTTCATTCCCAACCGACATACTGCTGATTAAAAGG 



Tsp509I 
Msel | 



Mnll 
Tthlllll) 
Msel | j 



1081 



CCAACCTTTAATTCAAGCAACCTCATTAACAAAGCACTATTACAAGCGTTCCTTTTGGTT 

+ + + + + + 1140 

GGTTGGAAATTAAGTTCGTTGGAGTAATTGTTTCGTGATAATGTTCGCAAGGAAAACCAA 
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Fig. 14 (con't) 



Muni 
Tsp509I BsrI 

I 



BsmAI 
BsmBI 
Bpml 
Aatll 
BsaHI 
Hin4I 
Maell 
Tthllll | 
Hindi | | 
II 



Hpyl78III 



Mmel 



TCAGGGAAAGACAATTGCCAGTCGTCCTGTTGACGACGTCTCTTTTTCACTATACTCCAG 



1141 



-+ 1200 



AGTCCCTTTCTGTTAACGGTCAGCAGGACAACTGCTGCAGAGAAAAAGTGATATGAGGTC 



Hpyl8 8IX 
Ahdl 
Haeiv) 
Hin4I j 
Mae I I | | 
I If 



Dpnl 
Sau3AI 
Hpyl78III | 
Hinf I | j 

Tf il | j 

I I 



ScrFI 
BsaJI | Alul 
EcoRII | CviJI 
Alwl Rsal | j BspMI | 

I Ml II 



ACGTGCTGTCGGACTTATTGGAGAATCTGGATCAGGGAAAAGTACCCTGGCGTTAGCTCT 



1201 



TGCACGACAGCCTGAATAACCTCTTAGACCTAGTCCCTTTTCATGGGACCGCAATCGAGA 



-+ 1260 



Bsal 

BsmAI Msel NlalV 

Hphll Mnl I Mnll| Msel Banl | Beef I 

II I II I II I 

CGCAGGTCTCCTACCTCTCACCTCTGGGTTCTTAACTTTTAACGGCACCCCAATCAAGTT 



1261 



-+ 1320 



GCGTCCAGAGGATGGAGAGTGGAGACCCAAGAATTGAAAATTGCCGTGGGGTTAGTTCAA 



Smll 

Dpnl 

Sau3AI 

Hgal | 

Tsp509l| | 

Bed | j | 

Bsml Bce83I III III Taal 

CviRI BsaHI || || III Rsal 

I I II II I 

GCATTCTAAACACGGACGCCATCAATTACGATCTCAAGTACGGTTGGTCTTTCAAAATCC 



1321 



CGTAAGATTTGTGCCTGCGGTAGTTAATGCTAGAGTTCATGCCAACCAGAAAGTTTTAGG 



-+ 1380 



Faul 

Sthl32l| CviJI 

Thai | j Hael 

Msel Acil | j j Bf al Haelll BsmFI 

I II II I I I 

ACAAGCTTCATTAAACCCGCGAAAAACTATCCTAGATAGTTTAGGCCACTCTCTGCTTTA 

1381 + + + + + + 1440 

TGTTCGAAGTAATTTGGGCGCTTTTTGATAGGATCTATCAAATCCGGTGAGAGACGAAAT 



Alul 
CviJI 
Hindi I I | 
I I 
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Fig. 14 (con't) 



Bf al 
Rsal | 

Seal | Tsp509I 
TatI | j Taal Sspl | BseMII 

III I II I 

CCATAAACTCGTCCCAAAAGAAAAAGTACTAGCAACGGTAAGGGAATATTTAGAATTGGT 



1441 



1500 



GGTATTTGAGCAGGGTTTTCTTTTTCATGATCGTTGCCATTCCCTTATAAATCTTAACCA 



1501 



Hpyl78III 
BstXI 
Mnll 

Ddel Mnll | 

Hpyl88IX BseRI Alul | j 

Mnll | HphI CviJI j j 

II I Ml 

AGGGTTATCTGAGGAGTATTTTTATCGTTATCCTCACCAGCTTTCTGGAGGACAACAACA 

+ + + + + + 1560 

TCCCAATAGACTCCTCATAAAAATAGCAATAGGAGTGGTCGAAAGACCTCCTGTTGTTGT 



Plel 
BsmAI 
Sfcl | 
Bpml | j 
Hinf I | I j 
I I I 



BseMII 
Maelll 
Tsp4 5I 
Mnll 



Banll 
Bspl286I 
CviJI | 
BsmFI I 



Tsp509I 
Msel | 
Plel | j 
Hinf I Ddel | j j 
I I I 



Tsp509I 



ACGAGTCTCTATAGCGAGAGCCCTATTAGGAGTCCCTCAGTTAATTATTTGTGACGAAAT 



1561 



-+ 1620 



TGCTCAGAGATATCGCTCTCGGGATAATCCTCAGGGAGTCAATTAATAAACACTGCTTTA 



Bbvl 
Tthlllll 
Acelll 
Hpyl88IX 
Tthlllll | 
Apol | | 
Tsp509I | | 



Hpyl78III 
Bf al | 

Xball Tsp509I | | || Cac8I | | | 

Ml I I I II I I M 

TGTTTCTGCTCTAGATTTATCTATTCAAGCACAAATTCTGAATATGCTTGCCGAGCTGCA 

1621 + + + + + + ^80 

ACAAAGACGAGATCTAAATAGATAAGTTCGTGTTTAAGACTTATACGAACGGCTCGACGT 



CviRI 
Fnu4HI 
Alul | 
CviJI | 
Tsel | 
Mwol I 



MboII Dpnl 
CviJI | Mnll Beef I Nlalll] 
Ddel | | BseMII | Earl | Sau3AI | | Bsgl Rsal Mwol 

I I t II II IN II I 

AAAAAAACTCAGCCTCACATATCTCTTCATTTCGCATGATCTTGCCGTTGTACGCTCGTT 



1681 



-+ 1740 



TTTTTTTGAGTCGGAGTGTATAGAGAAGTAAAGCGTACTAGAACGGCAACATGCGAGCAA 
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Fig. 14 (con't) 



CviRI 

Mnll I Tsp509I 
I I I 

CTGCACAGAGGTATTCATTATGTATAAGGGGCAAATTGTAGAAAAAGGAAATACAAAACG 



1741 



■+ 1800 



GACGTGTCTCCATAAGTAATACATATTCCCCGTTTAACATCTTTTTCCTTTATGTTTTGC 



Dpnl 
Sau3AI 
Hpyl88IX | 
Fokl | I 
Alwl | j I 
II II 



Nlalll 
Hhal |MseI 
Thai |NspI | 
I I I I 



Hpyl78III 
Plel | 
BsmAI | | 
I I I 



CATTTTTTCTGATCCACAACATCCTTATACGCGCATGTTGTTAAATGCCCAACTTCCAGA 



1801 



-+ 1860 



GTAAAAAAGACTAGGTGTTGTAGGAATATGCGCGTACAACAATTTACGGGTTGAAGGTCT 



Dpnl 
Bell 
Sau3Al 
Hpyl78III | 

HaeIV|| | Hpyl8 8IX 

Hin4l|| | Hinfl |HinfI 

Hinf I Ml | Tfil | Tf il 

I III I III 
GACTC CTGATC AAAGGCAATCTAAACCTATATTC C AAGAATAT CACAAAGATT CTGAAG A 

1861 + + + + + + 1920 

CTGAGGACTAGTTTCCGTTAGATTTGGATATAAGGTTCTTATAGTGTTTCTAAGACTTCT 

CviRI 
Fokl 
Cac8I 

Sfcl Alul | 

MboIl| CviJI | 

SfaNI ||Eco57I Fokl Hindlll | | 

III I M I 

ATCTTGCTCTACAGGATGCTACTTTTACAATCGTTGTCCACAAAAACAAGAAGCTTGCAA 



1921 



-+ 1980 



TAGAACGAGATGTCCTACGATGAAAATGTTAGCAACAGGTGTTTTTGTTCTTCGAACGTT 



Dpnl BslI Bael BciVI 

Sau3AI | BsmAI | Hhal Hinfl | 

Hpyl88IX | | BsmBI j Thai |HgaI Taal Tfil | 

III ' II I I I I M 

GTC AGAG AT CAT C CCAAATCAAGGAG ACG CG C AC CATAC ATAC CGTTGTATC CATTG ATT 



1981 



-+ 2040 



CAGTCTCTAGTAGGGTTTAGTTCCTCTGCGCGTGGTATGTATGGCAACATAGGTAACTAA 
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Fig. 14 (con't) 



Bael 
Hin4I | 
I 



Alul 
CviJI 
Msel 
Af III | 
Smll | 
Mnll | | 
I I I 



Mnll 
BsaJl | 
Hinf I | j 
Msel Tfil Stylj 
I 



Hin4I 



CGTCCTCTACGCTATTCTTAAGCTACCATTAAGGAATCCCAAGGGAGAGGTCTGCTCTAT 



2041 



-+ 2100 



GCAGGAGATGCGATAAGAATTCGATGGTAATTCCTTAGGGTTCCCTCTCCAGACGAGATA 
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Figure 15: 

cgaagagcaa acctccacag ttacagagaa agacgtccaa cctaaaacac aagcaacacc 60 

acacgcttcg aagaaaaacg ttgcaagtcc ttcgacctct atg cca gga ate gag 115 

Met Pro Gly lie Glu . 
1 5 



aaa gca gca aca aca gtg get gta cct caa gac aaa tct gaa gaa gaa 
Lys Ala Ala Thr Thr Val Ala Val Pro Gin Asp Lys Ser Glu Glu Glu 
10 15 20 

aaa gtt aaa gag cga ttg aca aag egg gaa ctt ace tgt gaa gac ctt 
Lys Val Lys Glu Arg Leu Thr Lys Arg Glu Leu Thr Cys Glu Asp Leu 
25 30 35 

aaa gat aac ggc tat act gtc aat ttt gaa gac att tct att tta gag 
Lys Asp Asn Gly Tyr Thr Val Asn Phe Glu Asp He Ser He Leu Glu 
40 45 50 

ttg ttg cag ttc gta agt aaa att tct gga acg aac ttt gtc ttt gat 
Leu Leu Gin Phe Val Ser Lys He Ser Gly Thr Asn Phe Val Phe Asp 
55 60 65 

age aac gat ttg caa ttc aat gtc acg ate gtt tec cac gat cct act 
Ser Asn Asp Leu Gin Phe Asn Val Thr He Val Ser His Asp Pro Thr 
70 75 80 85 

tct gta gat gat tta tct aca ate tta eta caa gtc tta aaa atg cat 
Ser Val Asp Asp Leu Ser Thr He Leu Leu Gin Val Leu Lys Met His 
90 95 100 

gac ttg aag gtt gtt gaa caa ggc aat aac gtc ctt ate tat cgt aat 451 
Asp Leu Lys Val Val Glu Gin Gly Asn Asn Val Leu lie Tyr Arg Asn 
105 HO H5 



cct cat ctt tct aag eta tec aca gta gtc aca gac age tec tta aaa 
Pro His Leu Ser Lys Leu Ser Thr Val Val Thr Asp Ser Ser Leu Lys 
120 125 130 

gaa acg tgt gaa get gtt gtg gtt ace cga gtg ttc cgt ctt tac agg 
Glu Thr Cys Glu Ala Val Val Val Thr Arg Val Phe Arg Leu Tyr Arg 
135 140 145 

cgt cag ccc tct gca gca gta aat att att caa cct tta ctt tec cat 
Arg Gin Pro Ser Ala Ala Val Asn He He Gin Pro Leu Leu Ser His 
150 155 160 165 

gat get ate gtt agt get tea gaa get act cgt cat gtt ate ate teg 
Asp Ala He Val Ser Ala Ser Glu Ala Thr Arg His Val He He Ser 
170 175 180 

gat att get ggt aat gtc gat aaa gtc agt gat ttg eta gca get eta 
Asp He Ala Gly Asn Val Asp Lys Val Ser Asp Leu Leu Ala Ala Leu 
185 190 195 



163 



211 



259 



307 



355 



403 



499 



547 



595 



643 



691 
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gat tgc cca ggc aca tct gtg gac atg act gaa tac gaa gtt aaa tat 739 

Asp Cys Pro Gly Thr Ser Val Asp Met Thr Glu Tyr Glu Val Lys Tyr 

200 205 210 

gcc aat ccc gca get ctt gtt age tac tgc caa gat gtt ctt ggt act 
Ala Asn Pro Ala Ala Leu Val Ser Tyr Cys Gin Asp Val Leu Gly Thr 
215 220 225 

ctg gcc gaa gat gat get ttc caa atg ttc ate caa cct gga acg aac 
Leu Ala Glu Asp Asp Ala Phe Gin Met Phe lie Gin Pro Gly Thr Asn 
230 235 240 245 

aaa att ttc gtc gtc tct tea cca cgt ctt gca aat aag gca gag cag 
Lys lie Phe Val Val Ser Ser Pro Arg Leu Ala Asn Lys Ala Glu Gin 
250 255 260 

etc ctg aag tec tta gat gtc cca gaa atg gca cat acc eta gat gat 
Leu Leu Lys Ser Leu Asp Val Pro Glu Met Ala His Thr Leu Asp Asp 
265 270 275 

cct gca agt act gcc ttg get ttg gga gga aca gga acc acg age cct 
Pro Ala Ser Thr Ala Leu Ala Leu Gly Gly Thr Gly Thr Thr Ser Pro 
280 285 290 

aag agt ttg egg ttc ttt atg tac aag ctg aag tat caa aat gga gaa 
Lys Ser Leu Arg Phe Phe Met Tyr Lys Leu Lys Tyr Gin Asn Gly Glu 
295 300 305 

gtg att get aat gcc etc caa gat ate ggt tac aat eta tat gta acc 
Val lie Ala Asn Ala Leu Gin Asp lie Gly Tyr Asn Leu Tyr Val Thr 
310 315 320 325 

aca get atg gac gaa gat ttc att aac act etc aat agt ate cag tgg 1123 
Thr Ala Met Asp Glu Asp Phe lie Asn Thr Leu Asn Ser lie Gin Trp 
330 335 340 

tta gag gtc aat aac tec ata gtt att ate gga aac caa ggg aat gtc 1171 
Leu Glu Val Asn Asn Ser lie Val He He Gly Asn Gin Gly Asn Val 
345 350 355 

gac aga gtt att ggc etc tta aac ggt tta gat tta cct cct aaa cag 1219 
Asp Arg Val He Gly Leu Leu Asn Gly Leu Asp Leu Pro Pro Lys Gin 
360 365 370 



787 



835 



883 



931 



979 



1027 



1075 



gtt tac ate gaa gtt tta att eta gat acc age tta gag aaa tec tgg 1267 

Val Tyr He Glu Val Leu lie Leu Asp Thr Ser Leu Glu Lys Ser Trp 

375 380 385 

gac ttt gga gtg caa tgg gta gcc eta ggt gat gaa caa agt aaa gta 1315 

Asp Phe Gly Val Gin Trp Val Ala Leu Gly Asp Glu Gin Ser Lys Val 

390 395 400 405 
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Fig. 15 con't) 

get tat get 
Ala Tyr Ala 



aaa gca act 
Lys Ala Thr 



cct acg cca 
Pro Thr Pro 
440 

tea gca ttc 
Ser Ala Phe 
455 

aag tct ttc 
Lys Ser Phe 
470 

gga gat act 
Gly Asp Thr 



caa caa get 
Gin Gin Ala 



aaa tac tat 
Lys Tyr Tyr 
520 

gaa gat att 
Glu Asp lie 
535 

aat gta gtt 
Asn Val Val 
550 

gcg tct gga 
Ala Ser Gly 



cgc tta caa 
Arg Leu Gin 



aga gat aaa 
Arg Asp Lys 
600 



tct gga eta 
Ser Gly Leu 
410 

gtc cct ccc 
Val Pro Pro 
425 

gga caa ttg 
Gly Gin Leu 



ggt eta gga 
Gly Leu Gly 



ctt act ttg 
Leu Thr Leu 
475 

gtc att gtc 
Val lie Val 
490 

teg ttt ttt 
Ser Phe Phe 
505 

ate caa gaa 
lie Gin Glu 



gga gtg aac 
Gly Val Asn 



aca eta caa 
Thr Leu Gin 
555 

tea eta aca 
Ser Leu Thr 
570 

att ccc gac 
lie Pro Asp 
585 

act aca aaa 
Thr Thr Lys 



ttg aat aat 
Leu Asn Asn 



ggc acg cca 
Gly Thr Pro 
430 

aca ggg ttc 
Thr Gly Phe 
445 

ate ate gga 
lie He Gly 
460 

gga ggc tta 
Gly Gly Leu 



ttg aat cct 
Leu Asn Pro 



gta ggg caa 
Val Gly Gin 
510 

aca gga act 
Thr Gly Thr 
525 

ctt gtc gtt 
Leu Val Val 
540 

ate gaa cag 
He Glu Gin 



cct gtc aca 
Pro Val Thr 



ggt tgt ttc 
Gly Cys Phe 
590 

gtg gtt tea 
Val Val Ser 
605 



act ggc ata 
Thr Gly He 
415 

aat cct ggt 
Asn Pro Gly 



tea gat atg 
Ser Asp Met 



aat gtc eta 
Asn Val Leu 
465 

tta agt gee 
Leu Ser Ala 
480 

aga ate atg 
Arg He Met 
495 

acg gtc cct 
Thr Val Pro 



gta acg caa 
Val Thr Gin 



acc tct aca 
Thr Ser Thr 
545 

acg ate tea 
Thr He Ser 
560 

gat aaa act 
Asp Lys Thr 
575 

tta gtt atg 
Leu Val Met 



gga gtg cct 
Gly Val Pro 



gee aca cct 
Ala Thr Pro 
420 

teg ate cct 
Ser lie Pro 
435 

ctg aac tct 
Leu Asn Ser 
450 

agt cat aaa 
Ser His Lys 



tta gat caa 
Leu Asp Gin 



get cag gat 
Ala Gin Asp 
500 

tac caa act 
Tyr Gin Thr 
515 

aat ate gat 
Asn He Asp 
530 

gtt get ccc 
Val Ala Pro 



gaa tta cat 
Glu Leu His 



tat gca gee 
Tyr Ala Ala 
580 

agt ggg cat 
Ser Gly His 
595 

ttg eta aac 
Leu Leu Asn 
610 



aca 1363 
Thr 



Ctt 1411 
Leu 



teg 1459 
Ser 



ggg 1507 
Gly 



gat 1555 

Asp 

485 

acg 1603 
Thr 



ate 1651 
He 



tat 1699 
Tyr 



aac 1747 
Asn 



tec 1795 

Ser 

565 

aca 1843 
Thr 



ate 1891 
lie 



tec 1939 
Ser 
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ata cca tta att cgt ggt tta ttt age cgt acc ate gac caa agg caa 1987 

He Pro Leu He Arg Gly Leu Phe Ser Arg Thr He Asp Gin Arg Gin 

615 620 625 

aaa cgc aat ate atg atg ttt att aag cct aag gtg att agt age ttt 
Lys Arg Asn He Met Met Phe He Lys Pro Lys Val He Ser Ser Phe 
630 635 640 645 

gaa gaa ggc act cgt gtt acc aat aag gaa gga tac aga tac aat tgg 
Glu Glu Gly Thr Arg Val Thr Asn Lys Glu Gly Tyr Arg Tyr Asn Trp 
650 655 660 

gaa get gat gaa gga tec atg caa gtg gee cct cgc cat get cct gaa 
Glu Ala Asp Glu Gly Ser Met Gin Val Ala Pro Arg His Ala Pro Glu 
665 670 675 

tgc caa gga cct cct tct tta cag get gaa agt gac ttt aaa ata ata 2179 
Cys Gin Gly Pro Pro Ser Leu Gin Ala Glu Ser Asp Phe Lys He He 
680 685 690 

gaa ata gaa get cag tagtggtata taaaagagga agatgatatt ctccgccgtg 2234 
Glu He Glu Ala Gin 
695 

gaatagcttc tgactctgtt gcattcaggg ggaaagccaa gaagatgtag agtcggccgt 22 94 
ataact 2300 



2035 



2083 



2131 
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Figure 16(RY-41) 

Restriction enzyme analysis of CPN1 00538 

Mnll Aatll 
Maelll | BsaHI | 

Mboll | | Maell j 

Hin4I | Taal j BplI | j Bsbl 

I I I I I I I I 

CGAAGAGCAAACCTCCACAGTTACAGAGAAAGACGTCCAACCTAAAACACAAGCAACACC 

1 + + + + + + 60 

GCTTCTCGTTTGGAGGTGTCAATGTCTCTTTCTGCAGGTTGGATTTTGTGTTCGTTGTGG 



Fnu4HI 
Tsel 
Bcgl 
Hpyl78III 
TaqI 



CviRI 

NspV Bcgl | 

TaqI Mbollj 
Tthlllll | Acll | | 

Mmel | j Bcgl Maell j j 

I I I I III 



TaqI 



Hinf I 
Tf il 
Bcgl 
Mnll 
ScrFI | 
EcoRII | j 



61 



ACACGCTTCGAAGAAAAACGTTGCAAGTCCTTCGACCTCTATGCCAGGAATCGAGAAAGC 

+ + + + + + 

TGTGCGAAGCTTCTTTTTGCAACGTTCAGGAAGCTGGAGATACGGTCCTTAGCTCTTTCG 



120 



Hpyl78III 



Smll 
Rsal 
TspRI 
AlwNI 



CviJI 
Bbvl | 
Taal | j 



Bce83I 



Hpyl8 8IX 
Mnll I 



Faul 
Sthl32I 
Eco57I 
Mboll | 
Msel | | 
Mboll j j | 



121 



I II II I ! I II II I I 

AGCAACAACAGTGGCTGTACCTCAAGACAAATCTGAAGAAGAAAAAGTTAAAGAGCGATT 

, + + + h 

TCGTTGTTGTCACCGACATGGAGTTCTGTTTAGACTTCTTCTTTTTCAATTTCTCGCTAA 



180 



181 



Mboll Bcefl 
Bbsl | Tsp509I | 

Acil Msel | CviJI Taal | | 

| I I I I I I 

GACAAAGCGGGAACTTACCTGTGAAGACCTTAAAGATAACGGCTATACTGTCAATTTTGA 

+ + + + + + 240 

CTGTTTCGCCCTTGAATGGACACTTCTGGAATTTCTATTGCCGATATGACAGTTAAAACT 
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Fig. 16 (con't) 



Hpyl78III 

MboII Apol | 

Bbsl I CviRI Tsp50 9I | 

II i ii 

AGACATTTCTATTTTAGAGTTGTTGCAGTTCGTAAGTAAAATTTCTGGAACGAACTTTGT 

241 + + + + + + 300 

TCTGTAAAGATAAAATCTCAACAACGTCAAGCATTCATTTTAAAGACCTTGCTTGAAACA 



BsiEI 
Pvul 
Dpnl ! 
Sau3AI 
Hpyl78III | 
Tsp509I Maelll | | 
CviRI | Tsp45I | | 

II I I I 

CTTTGATAGCAACGATTTGCAATTCAATGTCACGATCGTTTCCCACGATCCTACTTCTGT 

301 + + + - + + + 360 

GAAACTATCGTTGCTAAACGTTAAGTTACAGTGCTAGCAAAGGGTGCTAGGATGAAGACA 



Dpnl 
Sau3AI | 
Alwl | | 

I I I 



Sfcl 



Nlalll 

HaelV Nsil | 

Hin4I Msel CviRI | | 

I I I I I 

AGATGATTTATCTACAATCTTACTACAAGTCTTAAAAATGCATGACTTGAAGGTTGTTGA 

361 + + + + + + 420 

TCTACTAAATAGATGTTAGAATGATGTTCAGAATTTTTACGTACTGAACTTCCAACAACT 



Alul 

CviJl Maelll 

Ddel | Tsp45I 

Maell Mnl I | Taal | 

| I I I I 

ACAAGGCAATAACGTCCTTATCTATCGTAATCCTCATCTTTCTAAGCTATCCACAGTAGT 

421 + + + + + + 480 

TGTTCCGTTATTGCAGGAATAGATAGCATTAGGAGTAGAAAGATTCGATAGGTGTCATCA 



Af 1III 
Mae 1 1 

Alul Acelll | Alul BstEII Sthl32I 

CviJI Msel | | CviJI Maelll Aval |HgaI 

I I I I I II II 

CACAGACAGCTCCTTAAAAGAAACGTGTGAAGCTGTTGTGGTTACCCGAGTGTTCCGTCT 

48 1 + „ + + + + + 540 

GTGTCTGTCGAGGAATTTTCTTTGCACACTTCGACAACACCAATGGGCTCACAAGGCAGA 
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Fig. 16 (con't) 



Mnll 
PstI 
Fnu4HI 
CviRI | _ 

Tsel | j j Bbvl EC057I 
BsaHI CviJI Sfcl ||| | Sspl | SfaNI Nlalll | 

I I MM I II I 1 1 

TTACAGGCGTCAGCCCTCTGCAGCAGTAAATATTATTCAACCTTTACTTTCCCATGATGC 

541 + + + + + + 600 

AATGTCCGCAGTCGGGAGACGTCGTCATTTATAATAAGTTGGAAATGAAAGGGTACTACG 



Alul 
CviJI 
Mwol | 

Hpyl88IX | j Nlalll Hpyl88IX TaqI 

II I I I I 

TATCGTTAGTGCTTCAGAAGCTACTCGTCATGTTATCATCTCGGATATTGCTGGTAATGT 

601 + + + + + + 660 

ATAGCAATCACGAAGTCTTCGATGAGCAGTACAATAGTAGAGCCTATAACGACCATTACA 



Hpyl78III 
Bfal 
Xbal 
Alul 
CviJI 
Fnu4HI 



Tsel 
Cac8I 
Bfal | 
Nhel | j 
TspRI | | | 
II 



ScrFI 
BsaJI 
EcoRII 
Acelll | 
Bbvl | j 

II 



MslI 



Nlalll 



661 



CGATAAAGTCAGTGATTTGCTAGCAGCTCTAGATTGCCCAGGCACATCTGTGGACATGAC 

h + + + + + 

GCTATTTCAGTCACTAAACGATCGTCGAGATCTAACGGGTCCGTGTAGACACCTGTACTG 



720 



Faul 
Sthl32I [ 
Alul | | 
CviJI j [ 
Fnu4HI | | | Acelll 
Tsel | III Alul 
Acil || Ml CviJI 
Msel Mwol | || III Bbvl | Xcml 

I II II III II I 

TGAATACGAAGTTAAATATGCCAATCCCGCAGCTCTTGTTAGCTACTGCCAAGATGTTCT 

721 + + + + + + 780 

ACTTATGCTTCAATTTATACGGTTAGGGCGTCGAGAACAATCGATGACGGTTCTACAAGA 



67/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Fig. 16 (con't) 



CviJI 
Haelll 
Sf aNI | 
Rsal Eael | | 
Bcgl |GdiII | | 
II 



ScrFI 
Bcgl EcoRII | 



Apol 
Tsp509I 



MboII 
Fokl | 

II III I 

TGGTACTCTGGCCGAAGATGATGCTTTCCAAATGTTCATCCAACCTGGAACGAACAAAAT 

781 + + + + + + 840 

ACCATGAGACCGGCTTCTACTACGAAAGGTTTACAAGTAGGTTGGACCTTGCTTGTTTTA 



HphI 
Mmel | 
MboII | | 
II I 



Mae 1 1 
Earl | 
BsmAI | j 
BsmBI | | 



BsmFI 
Hpyl7 8III 
AlwNI 
Alul 

CviJI 
Fnu4HI | 
Tsel| | 
II I 



Ddel 
Acelll | 
Bbvl | | 

II I 



CviRI Mwol 
I I I 

TTTCGTCGTCTCTTCACCACGTCTTGCAAATAAGCCAGAGCAGCTCCTGAAGTCCTTAGA 

841 + + + + + + 900 

AAAGCAGCAGAGAAGTGGTGCAGAACGTTTATTCCGTCTCGTCGAGGACTTCAGGAATCT 



BsaJI 
BstAPI 
MWOl 

Dpnl Rsal | 

Sau3AI | Seal | 

BslI Alwl | | Tat I || | CviJI 

Eco57I | Bfal | | CviRI | j | Sty I Mnll 

II I I I I III I I 

TGTCCCAGAAATGGCACATACCCTAGATGATCCTGCAAGTACTGCCTTGGCTTTGGGAGG 

901 + + + + + + 960 

ACAGGGTCTTTACCGTGTATGGGATCTACTAGGACGTTCATGACGGAACCGAAACCCTCC 



961 



Ddel 
Banll 
Bspl286I 

BssSI I I Rsal 

Drdll | j | BsrGI | Alul 

NlaIV| |CviJI jj Acil TatI j CviJI 

III III I I I I 

AACAGGAACCACGAGCCCTAAGAGTTTGCGGTTCTTTATGTACAAGCTGAAGTATCAAAA 

+ + + + + + 

TTGTCCTTGGTGCTCGGGATTCTCAAACGCCAAGAAATACATGTTCGACTTCATAGTTTT 



1020 
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Fig. 16 (con't) 

Maelll Alul 

Mnll | CviJI 

Eco57I EcoRV| | Maelll MslI 

I II I II 

TGGAGAAGTGATTGCTAATGCCCTCCAAGATATCGGTTACAATCTATATGTAACCACAGC 

102 1 + + + + + + 1080 

ACCTCTTCACTAACGATTACGGGAGGTTCTATAGCCAATGTTAGATATACATTGGTGTCG 
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BstXI Msel | Bsrl | j 

I II III 
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108 1 + + + + + + H40 

ATACCTGCTTCTAAAGTAATTGTGAGAGTTATCATAGGTCACCAATCTCCAGTTATTGAG 
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I I II Ml I II 
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Figure 17: CPN100557 

tagcttgaaa tagcttcctc caattgtgat ttctgaagaa gtataggggg aaatgtcgaa 60 



gagatagtct tgttttaaag gaggagggga aaacggttta atg age aga aaa gac 115 
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400 





cgt 


tgg 


gtt 


tta 


Arg 


Trp 


Val 


Leu 


Arg 


Trp 


Val 


Leu 








405 



aaa 


gac 


gtc 


teg 


ggc 


att 


tec 


tat 


get 


aca 


tec 


ata 


act 


get 


tgg 


gtg 


Lys 


Asp 


Val 


Ser 


Gly 


He 


Ser 


Tyr 


Ala 


Thr 


Ser 


He 


Thr 


Ala 


Trp 


Val 


Lys 


Asp 


Val 


Ser 


Gly 


lie 


Ser 


Tyr 


Ala 


Thr 


Ser 


He 


Thr 


Ala 


Trp 


Val 










410 










415 










420 





cag 


tta 


tat 


ttc 


etc 


tgg 


tat 


tat 


tct 


teg 


aaa 


aga 


etc 


cct 


atg 


tac 


Gin 


Leu 


Tyr 


Phe 


Leu 


Trp 


Tyr 


Tyr 


Ser 


Ser 


Lys 


Arg 


Leu 


Pro 


Met 


Tyr 


Gin 


Leu 


Tyr 


Phe 


Leu 


Trp 


Tyr 


Tyr 


Ser 


Ser 


Lys 


Arg 


Leu 


Pro 


Met 


Tyr 








425 










430 










435 







tct 


aag 


tta 


ctt 


tgg 


gag 


age 


ate 


egg 


cgt 


tec 


ata 


aaa 


gtt 


atg 


gga 


Ser 


Lys 


Leu 


Leu 


Trp 


Glu 


Ser 


He 


Arg 


Arg 


Ser 


He 


Lys 


Val 


Met 


Gly 


Ser 


Lys 


Leu 


Leu 


Trp 


Glu 


Ser 


He 


Arg 


Arg 


Ser 


He 


Lys 


Val 


Met 


Gly 






440 










445 










450 









acc 


act 


atg 


ctt 


get 


tgt 


atg 


att 


Thr 


Thr 


Met 


Leu 


Ala 


Cys 


Met 


He 


Thr 


Thr 


Met 


Leu 


Ala 


Cys 


Met 


He 




455 










460 





act 


eta 


ggc 


tta 


aat 


ate 


ctt 


acg 


Thr 


Leu 


Gly 


Leu 


Asn 


He 


Leu 


Thr 


Thr 


Leu 


Gly 


Leu 


Asn 


He 


Leu 


Thr 








465 











caa 


act 


aca 


tat 


gta 


att 


ttc 


tta 


aac 


ccc 


etc 


aca 


cca 


ctt 


get 


tgg 


Gin 


Thr 


Thr 


Tyr 


Val 


He 


Phe 


Leu 


Asn 


Pro 


Leu 


Thr 


Pro 


Leu 


Ala 


Trp 


Gin 


Thr 


Thr 


Tyr 


Val 


He 


Phe 


Leu 


Asn 


Pro 


Leu 


Thr 


Pro 


Leu 


Ala 


Trp 


470 










475 










480 










485 



ccc 


tta 


tec 


tec 


ata 


acg 


get 


caa 


gca 


att 


get 


ttt 


tta 


tct 


gag 


age 


Pro 


Leu 


Ser 


Ser 


He 


Thr 


Ala 


Gin 


Ala 


He 


Ala 


Phe 


Leu 


Ser 


Glu 


Ser 


Pro 


Leu 


Ser 


Ser 


He 


Thr 


Ala 


Gin 


Ala 


He 


Ala 


Phe 


Leu 


Ser 


Glu 


Ser 










490 










495 










500 




tgc 


att 


ttc 


ttg 


get 


ttt 


ttg 


ttt 


ggt 


ttt 


gca 


aaa 


ctg 


ctt 


cga 


gta 


Cys 


He 


Phe 


Leu 


Ala 


Phe 


Leu 


Phe 


Gly 


Phe 


Ala 


Lys 


Leu 


Leu 


Arg 


Val 


Cys 


He 


Phe 


Leu 


Ala 


Phe 


Leu 


Phe 


Gly 


Phe 


Ala 


Lys 


Leu 


Leu 


Arg 


Val 








505 










510 










515 







gaa 


gat 


ctt 


att 


Glu 


Asp 


Leu 


He 


Glu 


Asp 


Leu 


He 






520 




ggt 


ctt 


ttg 


caa 


Gly 


Leu 


Leu 


Gin 


Gly 


Leu 


Leu 


Gin 




535 







aat 


ttg 


get 


tct 


Asn 


Leu 


Ala 


Ser 


Asn 


Leu 


Ala 


Ser 








525 


aga 


caa 


cac 


gtg 


Arg 


Gin 


His 


Val 



540 



ttt gaa tac tgg 
Phe Glu Tyr Trp 
Phe Glu Tyr Trp 



atg caa gac act 
Met Gin Asp Thr 

545 



cgt 


ggg 


caa 


egg 


Arg 


Gly 


Gin 


Arg 


Arg 


Gly 


Gin 


Arg 


530 








caa 


aat 






Gin 


Asn 







taatcatgtt tgtttcttgt agctcagtcg ctttctttta gctttaagtt ttgatagect 1801 
gcttggtctt ctgtttctac acttaatatt gatactaagg atactatgaa aaaacaggta 1861 



tatcaatggt tagcgagtgt ggttctttta gegctgaca 1900 
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Figure 18 (RY-43) 

Restriction enzyme analysis of CPN 100557 

TaqI 

Alul Alul Muni Hpyl88IX Eco57I | 

CviJI CviJI Tsp509I Mnll | MboII Earl | | 

I i ill I ll I 

TAGCTTGAAATAGCTTCCTCCAATTGTGATTTCTGAAGAAGTATAGGGGGAAATGTCGAA 

x + + + + + + 60 

ATCGAACTTTATCGAAGGAGGTTAACACTAAAGACTTCTTCATATCCCCCTTTACAGCTT 



Msel 
BseRI | 

Taal| | Mnll 



Mnll 
Dral | 
Msel | | 
Mnll | | j 
MboII | | | I 
I III 

GAGATAGTCTTGTTTTAAAGGAGGAGGGGAAAACGGTTTAATGAGCAGAAAAGACAATGA 

61 + + + + + + 120 

CTCTATCAGAACAAAATTTCCTCCTCCCCTTTTGCCAAATTACTCGTCTTTTCTGTTACT 

Alul Hpyl7 8III 

CviJI Mspl | 

BpulOI | BsaWI | j 

Ddel |Tsp509I Msel BspEI j j Sfcl 

I I I I Ml I 

GGTTTCCTTAGCTCGTTCAATTTTTAATATATTATCCGGAACTTTCTGTAGTCGTATTAC 

12 i + + + + + + 180 

C CAAAGGAATCGAG CAAGTTAAAAATTATATAATAGGC CTTG AAAG AC ATC AGCATAATG 

Muni 
Tsp5091 
Dpnl 
Bbvl 
Sau3AI 

Tsp509I Alul 



Fnu4HI 
Alul | 
CviJI j 
Tsel j 
Cjel || 



Hpyl78III |CviRI Acelll CviJI | 

TaqI | | C j el | BsrDI Alwl | | 

I I I II I III 

AGGGATATTTCGAGAAATTGCAATGGCAACCTATTTTGGAGCTGATCCAATTGTAGCTGC 

181 + + + + + + 240 

TCCCTATAAAGCTCTTTAACGTTACCGTTGGATAAAACCTCGACTAGGTTAACATCGACG 

Hpyl78III 

Msel Mnll Banll Bf al | 

Taal Aflll| Apol | Bspl286I Fokl | 

Rsal | Smll|Tsp509I j CviJI |Xbal|| 

II II II I I III 

TTTCTGGTTAGGTTTCCGTACTGTTTTTTTCTTAAGAAAAATTTTAGGAGGGCTCATTCT 

241 + + + + + + 300 

AAAGACCAATCCAAAGGCATGACAAAAAAAGAATTCTTTTTAAAATCCTCCCGAGTAAGA 
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Fig. 18 (con't) 



Fnu4HI 
BsiEI | 
Pvul I 
Dpnl | 
BsmAI | 
Sau3AI | 

Mnll Hpyl78III | | 

Apol | BsiHKAI Mill Taul 

CviJI Tsp509l| Bspl286I TaqI | j j j Acil | 

i II I Mill II 

AGAACAAGCCTTCATCCCTCATTTTGAATTTCTCCGTGCTCAAAGTCTCGATCGTGCGGC 

301 + : + + + + + 360 

TCTTGTTCGGAAGTAGGGAGTAAAACTTAAAGAGGCACGAGTTTCAGAGCTAGCACGCCG 

Hpyl78III Fnu4HI 
Bfal| Tsel| 
Hgal | Mmel | j 

Hpyl88IX Xbal|| Msel | || Bbvl 

I Ml I I II I 

GTTTTTTTTCCGACGCTTTTCTAGATTGATTAAAGGCAGCACTATTATATTCACTCTGCT 

36 1 + + + + + + 420 

CAAAAAAAAGGCTGCGAAAAGATCTAACTAATTTCCGTCGTGATAATATAAGTGAGACGA 

Mnll 

Acll| BsmFI 

Earl j Hinf I 

RleAI MboII Maellj MboII Tfil 

II II II 

TATTGAAGCAGTATTGTGGGTATTCTTCAATAACGTTGAAGAGGGGACTTACGATATGAT 

421 + + + + + + 480 

ATAACTTCGTCATAACACCCATAAGAAGTTATTGCAACTTCTCCCCTGAATGCTATACTA 

Rsal 
BsrGI | 
Msel TatI j 
I I I 

TCTCCTTACTATGATACTCTTGCCCTGTGGCATTTTCTTAATGATGTACAATGTAAACGG 

481 + + + + + + 540 

AGAGGAATGATACTATGAGAACGGGACACCGTAAAAGAATTACTACATGTTACATTTGCC 

Sthl32I Sthl32I 

TspRI | BscGI | 

Haell Taal | j Alul | | 

Hhal| Beef I | j | CviJI | | 

II II I I III 

CGCTTTGCTTCACTGTGGAAATAAGTTTTTCGGGGTGGGATTAGCTCCCGTAGTTGTAAA 

541 + + + + + + 600 

GCGAAACGAAGTGACACCTTTATTCAAAAAGCCCCACCCTAATCGAGGGCATCAACATTT 
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Fig. 18 (con't) 



CviJI 
Fnu4HI | 
Taul | 
Acil| | 

II 



Bfal 
Dpnl 
BstYI 
Sau3AI 
Hpyl88IX| 
Alwl | | 
I II 



601 



TATCATTTGGATTTTCTTTGTTATAGCGGCTCGTCATTCAGATCCTAGAGAGCGTATTAT 

+ + + + + + 

ATAGTAAACCTAAAAGAAACAATATCGCCGAGCAGTAAGTCTAGGATCTCTCGCATAATA 



660 



Bfal 
Sthl32I 
CviJI | 
BsaJI | | 
BstDSI 



Cjel 



NspV 
TaqI 



ScrFI 
EcoRII | 
NlaIV| | 
Taal | | | 
I II I 



MboII TaqI |MseI 

I III 
CGGTTTATCCGTGGCTCTAGTTATCGGGTTTTTCTTCGAATGGTTAATCACGGTTCCTGG 

661 + + + + + + 720 

GC CAAATAGG CAC CGAG ATCAATAGCC CAAAAAGAAGC TTAC CAATT AGTG C CAAGG ACC 



Hpyl78III 
Bce83I MboII 
Earl | Banll 
Sapl | Bspl286I 
Apol Bpml | | Hin4I 

Tsp509I Cjel | || CviJI |Smll| 

I I I II II 

AGTATGGAAATTTCTATTAGAAGCGAAGAGCCCACCTCAAGAACACGATAGTGTTCGAGC 

721 + + + + + + 780 

TCATACCTTTAAAGATAATCTTCGCTTCTCGGGTGGAGTTCTTGTGCTATCACAAGCTCG 



BplI Alul 
MslI | CviJI 
Mnll | | TaqI | 



Tthlllll 

Alul 
CviJI 

Alul MspAlI 
CviJI MboII PvuII 

Mwol | Msel | Sf aNl | 

ii ii ii 

TTTATTAGCTCCCTTATCTTTGGGTATTTTAACTTCAAGCATCTTCCAGCTGAACCTTCT 

781 + + 4- + + + 840 

AAATAATCGAGGGAATAGAAACCCATAAAATTGAAGTTCGTAGAAGGTCGACTTGGAAGA 



CviJI 
Haelll 
EcoO109I | 
Nlalll | I 

Cac8I Rsal | | j Rsal 

Mwol | BsrGI | | || TatI | 

CviJI | | TatI | |Sau96l| Mnll | j 

III III II IM 

TTCTGATATCTGCTTGGCTCGCTATGTACATGAAATAGGCCCTCTATATCTTATGTACTC 

841 + + + + + + 900 

AAGACTATAGACGAACCGAGCGATACATGTACTTTATCCGGGAGATATAGAATACATGAG 



EcoRV 
Hpyl8 8IX | 
Tthlllll | | 
I I I 
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Fig. 18 (con't) 



Msel 



Alul 
CviJI 



Acelll 



CviJI 



BseRI Taal 



CTTAAAGATTTATCAGCTCCCCATACATCTCTTTGGCTTTGGTGTGTTTACCGTTCTCCT 



901 



-+ 960 



GAATTTCTAAATAGTCGAGGGGTATGTAGAGAAACCGAAACCACACAAATGGCAAGAGGA 

MboII 
Nlalll 
Hpyl7 8III 
Real 

Rsal Dpnl | 

Mnll BsrGI | Mnllj 

Tsp509I | TatI j Sau3AI | | 

II M Ml 

CCCAGCAATTTCTCGTTGTGTACAGCGAGAAGATCATGAGAGGGGATTGAAACTTATGAA 



961 



-+ 1020 



GGGTCGTTAAAGAGCAACACATGTCGCTCTTCTAGTACTCTCCCCTAACTTTGAATACTT 



HphI 



1021 



Dpnl 
Bell | 

Nlalll Sau3AI j CviJI Ddel 

III l> 
GTTCGTTCTCACCCTAACCATGTCCGTAATGATCATTATGACAGCAGGGCTATTGCTCTT 

+ + + + + + 1080 

CAAGCAAGAGTGGGATTGGTACAGGCATTACTAGTAATACTGTCGTCCCGATAACGAGAA 



BsaXI 
Hin4I 
Hinfl 
ScrFI 
EcoRII I 
Alul | | 

CviJI • | | 
I 



1081 



1141 



BseMII 
Mwol 

Hpyl88IX AccI | 

Plel Bpml Ddel | Mnll | | 

I I Mill 

AGCTTTACCTGGAGTCCGTGTCCTTTATGAACACGGACTTTTCCCTCAGAGTGCTGTCTA 

+ + + + + + 1140 

TCGAAATGGACCTCAGGCACAGGAAATACTTGTGCCTGAAAAGGGAGTCTCACGACAGAT 

BsaJI 
Styl 

BsrI CviJI | 

NlalV | Hael j NlalV 

Mnll Banl I Haelll | CviJI | 

i mi M ii 

CGCTATTGTTCGTGTATTGCGAGGTTATGGTGCCAGTATTATCCCTATGGCCTTGGCTCC 

+ + + + + + 1200 

GCGATAACAAGCACATAACGCTCCAATACCACGGTCATAATAGGGATACCGGAACCGAGG 
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Fig. 18 (con't) 

Fnu4HI 
Taul 
Acil | 

MspAllj BsrBI Hinfl 

BsmAI CviRI | | Acil | Tf il 

I III M I 

TTTAGTCTCTGTTCTTTTTTATGCACAGCGGCAGTATGCTGTTCCGCTCTTTATAGGAAT 

1201 + + + + + + 1260 

AAATCAGAGACAAGAAAAAATACGTGTCGCCGTCATACGACAAGGCGAGAAATATCCTTA 

CviJI Alul BsaHI 

Hael CviJI Maell 

Haelll Hindlll | Sthl32I | 

CviJI MscI Bcefl | | Bfal Dral | | 

Rsal |EaeI | Sspl | Msel | |DrdII | Msel | j | 

I I I I II III II II I I 

CGGTACGGCTTTGGCCAATATTGTTTTAAGCTTGGTTCTAGGTCGTTGGGTTTTAAAAGA 

1261 + + + + + + 1320 

GCCATGCCGAAACCGGTTATAACAAAATTCGAACCAAGATCCAGCAACCCAAAATTTTCT 

Fokl 
BsmAI 
BsmBI 

Aval " | I Taqll 
Aatlll | I Tthlllll | CviRI MboII 

I I I i 

CGTCTCGGGCATTTCCTATGCTACATCCATAACTGCTTGGGTGCAGTTATATTTCCTCTG 

1321 + + + + + + 1380 

GCAGAGCCCGTAAAGGATACGATGTAGGTATTGACGAACCCACGTCAATATAAAGGAGAC 

Plel Fokl 
Mnll NspV| Rsal Maelll | SfaNI 

Bsgl | Taqlj Hinfl TatI |DdeI | j Mspl | 

I I II I Mill II 

GTATTATTCTTCGAAAAGACTCCCTATGTACTCTAAGTTACTTTGGGAGAGCATCCGGCG 

1381 + 4- + + + + 1440 

CATAATAAGAAGCTTTTCTGAGGGATACATGAGATTCAATGAAACCCTCTCGTAGGCCGC 



Tthlllll 
Drdll | 
NlaIV| 
Tthlllll | | 
Xcml | j | 

I III 

TTCCATAAAAGTTATGGGAACCACTATGCTTGCTTGTATGATTACTCTAGGCTTAAATAT 

1441 + + + + + + 1500 

AAGGTATTTTCAATACCCTTGGTGATACGAACGAACATACTAATGAGATCCGAATTTATA 



Msel 
CviJI | 

Cac8I Bfal | | 
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Fig. 18 (con't) 



Bce83I 
CjePI 
CviJI 
Haelll 
Sau96I | 

Tsp509I BstXI | | 

Ndel I Msel Tthlllll Mnll | | | 

II I I I I M 

CCTTACGCAAACTACATATGTAATTTTCTTAAACCCCCTCACACCACTTGCTTGGCCCTT 

1501 + + + + + + 1560 

GGAATGCGTTTGATGTATACATTAAAAGAATTTGGGGGAGTGTGGTGAACGAACCGGGAA 



Muni 
Tsp509I 
Smll | Acelll| 

CviJI | | Beef I | I 

Mnll | | |BseMII j j 

III I Ml 



Ddel 
Hpyl8 8IX 
Tthlllll 

CjePI 
Bbvl 



CviRI 
Fnu4HI 
Alul | 
CviJI j 
Tsel j 
I I 



CviJI 
Tthlllll | 
I I 



1561 



ATCCTCCATAACGGCTCAAGCAATTGCTTTTTTATCTGAGAGCTGCATTTTCTTGGCTTT 

+ + + + + + 1620 

TAGGAGGTATTGCCGAGTTCGTTAACGAAAAAATAGACTCTCGACGTAAAAGAACCGAAA 



Dpnl MboII 
Bglll |Tsp509l| 
BstYI j Msel | | 
CviRI TaqI Sau3AI j Vspl j | CviJI 

I I I I III I 

TTTGTTTGGTTTTGCAAAACTGCTTCGAGTAGAAGATCTTATTAATTTGGCTTCTTTTGA 



1621 



■+ 1680 



AAACAAACCAAAACGTTTTGACGAAGCTCATCTTCTAGAATAATTAAACCGAAGAAAACT 



BsrI 
Sthl32I 



SimI 
BSCGI [ 



BsaAI 
Pmll 
Mae 1 1 
Afllll | 
SfaNI I j 
CviRI |BsbI | 



I I I 



CviRI 



Tsp509I 
Tthlllll 



ATACTGGCGTGGGCAACGGGGTCTTTTGCAAAGACAACACGTGATGCAAGACACTCAAAA 



1681 



-+ 1740 



TATGACCGCACCCGTTGCCCCAGAAAACGTTTCTGTTGTGCACTACGTTCTGTGAGTTTT 



Tthlllll 

Msel | Bbsl 

Ddel Alul | | Cac8I 

Msel Alul | CviJI j I MboII 

Vspl Nlalll CviJI | BseMIT | j j CviJI | 

I I II | | | I I I 

TTAATCATGTTTGTTTCTTGTAGCTCAGTCGCTTTCTTTTAGCTTTAAGTTTTGATAGCC 

1741 + + + + + + 1800 

AATTAGTACAAACAAAGAACATCGAGTCAGCGAAAGAAAATCGAAATTCAAAACTATCGG 
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Fig. 18 (con't) 

Sspl. Ddel 
Msel | BciVI | 
II II 

TGCTTGGTCTTCTGTTTCTACACTTAATATTGATACTAAGGATACTATGAAAAAACAGGT 

180 i + + + + + + I860 

ACGAACCAGAAGACAAAGATGTGAATTATAACTATGATTCCTATGATACTTTTTTGTCCA 

Haell 
Hhal | 

Drdll Eco47III | j 

I Ml 
ATATCAATGGTTAGCGAGTGTGGTTCTTTTAGCGCTGACA 

1861 + + + + 1900 

TATAGTTACCAATCGCTCACACCAAGAAAATCGCGACTGT 
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Figure 19: CPN1 00622 

tctcaagagt aaccttatcc ttagattatt cagctcaagt ctcctcgtca actgtaggtc 60 

aataccttaa agctgagagt cattgcacat tttaaccaca atg aaa aca tea agg 115 

Met Lys Thr Ser Arg 
1 5 

aat aaa cag tgc aaa ata aca gat ccc tta agt aaa tct tec ttc ttt 163 
Asn Lys Gin Cys Lys lie Thr Asp Pro Leu Ser Lys Ser Ser Phe Phe 
10 . 15 20 

gtt gga gec tta att tta ggt aaa act aca ata etc ctt aat gcg act 211 
Val Gly Ala Leu lie Leu Gly Lys Thr Thr lie Leu Leu Asn Ala Thr 
25 30 35 

ccg ttg tct gac tat ttt gat aat caa gca aat caa etc aca aca etc 259 
Pro Leu Ser Asp Tyr Phe Asp Asn Gin Ala Asn Gin Leu Thr Thr Leu 
40 45 50 

ttc cct eta att gat act ctt act aac atg act ccc tac tct cat aga 307 
Phe Pro Leu" lie Asp Thr Leu Thr Asn Met Thr Pro Tyr Ser His Arg 
55 60 65 



gca aca ctt ttt gga gtt agg gat gac act aac caa gac att gtc etc 
Ala Thr Leu Phe Gly Val Arg Asp Asp Thr Asn Gin Asd He Val Leu 
70 75 80 " 85 

gat cac cag aat tec ata gaa age tag ttc gaa aac ttc tct caa gac 
Asp His Gin Asn Ser He Glu Ser Trp Phe Glu Asn Phe Ser Gin Asp 
90 95 100 

ggc ggt get etc tct tgc aaa tea ctt gee ata acg aat aca aaa aac 
Gly Gly Ala Leu Ser Cys Lys Ser Leu Ala He Thr Asn Thr Lys Asn 
105 no us 

caa att ctt ttc eta aat age ttt get att aaa aga get ggt gcg atg 
Gin He Leu Phe Leu Asn Ser Phe Ala He Lys Arg Ala Gly Ala Met 
120 125 130 

tat gtt gat ggt aat ttc gat ctt tct gag aat cat ggt tec ate att 
Tyr Val Asp Gly Asn Phe Asp Leu Ser Glu Asn His Gly Ser He He 
135 140 145 

ttc tct ggg aat tta age ttt cct aat gca agt aat ttc get gat act 
Phe Ser Gly Asn Leu Ser Phe Pro Asn Ala Ser Asn Phe Ala Asp Thr 
I 50 155 iso 165 



355 



403 



451 



499 



547 



595 



tgt aca ggg gga get 
Cys Thr Gly Gly Aia 
Thr Gly Gly Ala 
170 

aat caa gga acc gca 
Asn Gin Gly Thr Ala 
Asn Gin Gly Thr Aia 
185 



gtz tta tgt teg aaa aat 
Val Leu Cys Ser Lys Asn 
Vai Leu Cvs Ser Lvs Asn 
175 

tac ttc att aac aac aag 
Tyr Phe lie Asn Asn Lvs 
Tyr Phe He Asn Asn Lys 
190 



gtt aca ate tea aaa 643 
Val Thr He Ser Lys 
Val Thr lie Ser Lvs 
180 

gca aaa tct tea gga 691 
Aia Lys Ser Ser Giy 
Ala Lys Ser Ser Gly 
195 
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Fig. 19 (con't) 

gga gca ate caa get gca ate ata aac att aag gac aac act ggc cct 739 

Gly Ala lie Gin Ala Ala lie lie Asn He Lys Asp Asn Thr Gly Pro 

Gly Ala He Gin Ala Ala He lie Ajh Ha Lys Asp Asn Thr Gly Pro 
200 205 210 

tgc ctg ttt ttt aat aat get gca ggc gga aca gcg ggg ggc gcg ttg 787 

Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr Ala Gly Gly Ala Leu 
Cys Leu Phe Phe Asn Asn Ala Ala Gly Gly Thr Ala Gly Gly Ala Leu 
215 220 225 

ttc get aat get tgt aga att gag aat aat tct cag cct ate tat ttt 835 

Phe Ala Asn Ala Cys Axg He Glu Asn Asn Ser Gin Pro He Tyr Phe 

Phe Ala Asn Ala Cys Arg lie Glu Asn Asn Ser Gin Pro He Tyr Phe 

230 235 240 245 

ttg aat aac caa tea ggt ctg ggt ggt gca ata aga gta cat caa gag 88 3 

Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala He Arg Val His Gin Glu 

Leu Asn Asn Gin Ser Gly Leu Gly Gly Ala He Arg Val His Gin Glu 

250 255 260 

tgc att ctt aca aag aat acc ggt tct gug ate ttc aac aat aat ttt 931 

Cys He Leu Thr Lys Asn Thr Gly Ser Val He Phe Asn Asn Asn Phe 

Cys He Leu Thr Lys Asn Thr Gly Ser Val He Phe Asn Asn Asn Phe 
265 270 275 

gec atg gaa gcg gac ate tct get aac cat tec tct gga ggg get ate 979 

Ala Met Glu Ala Asp He Ser Ala Asn His Ser Ser Gly Gly Ala He 

Ala Met Glu Ala Asp He Ser Ala Asn His Ser Ser Gly Gly Ala He 
280 285 290 

tat tgc att agt tgt: tct ata aaa gac aac cca gga att gca gee ttc 1027 

Tyr Cys He Ser Cys Ser He Lys Asp Asn Pro Gly He Ala Ala Phe 

Tyr Cys He Ser Cys Ser He Lys Asp Asn Pro Glv He Ala Ala Phe 
295 300 305 

gat aat aat act gca gca cga gat gga ggt get ate tgt aca caa tct 1075 

Asp Asn Asn Thr Ala Ala Arg Asp Gly Gly Ala He Cys Thr Gin Ser 

Asp Asn Asn Thr Ala Ala Arg Asd Gly Gly Ala He Cys Thr Gin Ser 

310 315 * 320 325 

eta act ata caa gac agt ggt ccc gtc tat ttc aca aac aat cag gga 1123 

Leu Thr He Gin Asp Ser Gly Pro Val Tyr Phe Thr Asn Asn Gin Gly 

Leu Thr He Gin Asd Ser Gly Pro Val Tyr Phe Thr Asn Asn Gin Gly 
330 335 340 

act tgg ggc ggc get ate atg etc cgt caa gat ggt gca tgc act tta 1171 

Thr Trp Gly Gly Ala lie Met Leu Arg Gin Asp Gly Ala Cys Thr Leu 

Thr TrD Gly Gly Ala lie Met Leu Arg Gin Asp Gly Ala Cys Thr Leu 

345 350 355 

ttt get gat cag gga gat att att ttt tat aat aat aga cac ttc aaa 1219 

Phe Ala Asp Gin Gly Asp lie lie Phe Tyr Asn Asn Arg His Phe Lys 

Phe Ala Asp Gin Gly Asp lie lie Phe Tyr Asn Asn Arg His Phe Lvs 

360 365 370 

gat act ttc age aat cat gtt tct gta aac tgc acg cgt aat gtc tea 1267 

Asp Thr Phe Ser Asn His Val Ser Val Asn Cys Thr Arg Asn Val Ser 

Asd Thr Phe Ser Asn His Val Ser Val Asn Cys Thr Arg Asn Val Ser 
375 380 385 
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tta aca gtt gga gca agt caa ggt cat tct get acc ttc tat gat ccc 1315 

Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala Thr Phe Tyr Asp P-o 

Leu Thr Val Gly Ala Ser Gin Gly His Ser Ala Thr Phe Tyr Aso P-o 



390 395 400 



405 



1363 



ata eta caa aga tat act ata caa aac tct ate caa aaa ttt aat cct 

lie Leu Gin Arg Tyr Thr He Gin Asn Ser lie Gin Lys Phe Asn P-o 

He Leu Gin Arg Tyr Thr He Gin Asn Ser lie Gin Lys Phe Asn P-o 
410 415 420 

aat cca gaa cac etc gga act ate ttg ttc tec tea aca tat att cog 1411 

Asn Pro Glu His Leu Gly Thr lie Leu Phe Ser Ser Thr Tyr He Pro 

Asn Pro Glu His Leu Gly Thr He Leu Phe Ser Ser Thr Tyr He Pro 
425 430 435 

gat aca teg act tct cgt gat gac ttc att tea cat ttc aga aac cac 1459 

Asp Thr Ser Thr Ser Arg Asp Asp Phe He Ser His Phe Arg Asn His 

Asp Thr Ser Thr Ser Arg Asp Asp Phe He Ser His Phe Arg Asn His 

440 445 450 

att gga ctg tac aac ggc aca ere get ctt gaa gat cga gca gag tgg 1507 

He Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu Asp Arg Ala Glu Trp 

He Gly Leu Tyr Asn Gly Thr Leu Ala Leu Glu Asp Arg Ala Glu Trp 
455 460 465 

aaa gtc tat aaa ttt gat caa ttt ggr ggg act eta egg tta ggc agt 1555 

Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr Leu Arg Leu Gly Ser 

Lys Val Tyr Lys Phe Asp Gin Phe Gly Gly Thr Leu Arg Leu Gly Ser 
470 475 480 485 

aga get gtg ttt tct aca aca gac gaa gaa caa agt age agt agt gtg 1603 

Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin Ser Ser Ser Ser Val 

Arg Ala Val Phe Ser Thr Thr Asp Glu Glu Gin Ser Ser Ser Ser Val 

490 495 500 

ggt tct gta att aac ate aat aat ctt gca att aac ctt ccc tct ate 1651 

Gly Ser Val He Asn He Asn Asn Leu Ala lie Asn Leu Pro Ser lie 

Gly Ser Val He Asn He Asn Asn Leu Ala He Asn Leu Pro Ser He 

505 510 515 

tta ggc aac aga gtt get ccc aag eta tgg att cgc ccc aca ggt tea 1699 

Leu Gly Asn Arg Val Ala Pro Lys Leu Trp lie Arg Pro Thr Gly Ser 

Leu Gly Asn Arg Val Ala Pro Lys Leu Trp He Arg Pro Thr Gly Ser 
520 525 530 

tea gca ccc tat age gaa gat aat aac cct ata ate aat etc tea gga 1747 

Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro He He Asn Leu Ser Gly 

Ser Ala Pro Tyr Ser Glu Asp Asn Asn Pro He He Asn Leu Ser Gly 
535 540 545 

cct ttg age eta ctg gat gac gag aac eta gat ccc tat gat act gca 1795 

Pro Leu Ser Leu Leu Asp Asp Glu Asn Leu Asp Pro Tyr Asp Thr Ala 

Pro Leu Ser Leu Leu Asp Asp Glu Asn Leu Asp Pro Tyr Asp Thr Ala 
550 555 560 " 565 

gac ctt gec caa cct ate gca gaa gtt cct ctt ctg tat etc tta gac 1843 

Asp Leu Ala Gin Pro lie Ala Glu Val Pro Leu Leu Tyr Leu Leu Asp 

Asp Leu Ala Gin Pro lie Ala Glu Val Pro Leu Leu Tyr Leu Leu Aso 

570 575 580 
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Figure 20 (RY-44) 

Restriction enzyme analysis of CPN 100622 
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Hpyl78III 
Rsal | BsmI 

Taqll CviRI TatI | | CviRI 

I I I I I I 

TAACCAATCAGGTCTGGGTGGTGCAATAAGAGTACATCAAGAGTGCATTCTTACAAAGAA 

841 + + + + + + 900 

ATTGGTTAGTCCAGACCCACCACGTTATTCTCATGTAGTTCTCACGTAAGAATGTTTCTT 
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Fig. 20 (con't) 



Dpnl 
Sau3AI 
MboII 
Mspl 
BsaWI | 
BsrFI | 
PinAI j 



Tsp509I 
I 



Acil 
Mwol 
Nlalll 
BsaJI 
BstDSI 
Ncol 
Styl 



TACCGGTTCTGTGATCTTCAACAATAATTTTGCCATGGAAGCGGACATCTCTGCTAACCA 

901 + + + + + + 

ATGGCCAAGACACTAGAAGTTGTTATTAAAACGGTACCTTCGCCTGTAGAGACGATTGGT 



960 



CviJI 
Hin4I 
BslI | 
Hpyl78III | | 
BstXI | | | 
Mnll | | |MnlI 
I 



961 



Tsp509I 
ScrFI |Fnu4HI 
Bpml BsaJI | | CviRI | 

CviRI | EcoRII j j Tselj 

II I I I II 

TTCCTCTGGAGGGGCTATCTATTGCATTAGTTGTTCTATAAAAGACAACCCAGGAATTGC 

+ + + + + + 

AAGGAGACCTCCCCGATAGATAACGTAATCAACAAGATATTTTCTGTTGGGTCCTTAACG 



1020 



TaqI 
CviJI | Bbvl 
I 



Mill I 
BssSI 
PstI 
Fnu4HI 
CviRI | 
Tsel j 
Sfcl | | 
I II 



Rsal 
BsrGI | 
TatI 



Bbvl 

Bccl|Hin4I TatI j BsmFI 

II I I I I 

AGCCTTCGATAATAATACTGCAGCACGAGATGGAGGTGCTATCTGTACACAATCTCTAAC 

1021 + + + + + + 1080 

TCGGAAGCTATTATTATGACGTCGTGCTCTACCTCCACGATAGACATGTGTTAGAGATTG 



Sthl32I 
BscGI 
Nlaiv 



TspRI 
Avail 
Sau96I 
PshAI | 
Taal | j 



Haell 
Fnu4HI | 
Taul | 
Tthlllll AciljHhalj 
I II II 

TATACAAGACAGTGGTCCCGTCTATTTCACAAACAATCAGGGAACTTGGGGCGGCGCTAT 

1081 + + + + + + l* 40 

ATATGTTCTGTCACCAGGGCAGATAAAGTGTTTGTTAGTCCCTTGAACCCCGCCGCGATA 
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Fig. 20 (con't) 



CviRI 
Nlalll 
Nspl 
SphI 

Cac8I | Dpnl 
Hpyl78III CviRI | | Bell | 

Nlalll |BccI | I I Sau3AI j 

I I I I I I M 

CATGCTCCGTCAAGATGGTGCATGCACTTTATTTGCTGATCAGGGAGATATTATTTTTTA 



1141 



1200 



GTACGAGGCAGTTCTACCACGTACGTGAAATAAACGACTAGTCCCTCTATAATAAAAAAT 



Nlalll 
Bsgl | 



Mmel 
Thai 
Afllll 
Cac8I 
Mlul 
CviRI I 



TAATAATAGACACTTCAAAGATACTTTCAGCAATCATGTTTCTGTAAACTGCACGCGTAA 



1201 



-+ 1260 



ATTATTATCTGTGAAGTTTCTATGAAAGTCGTTAGTACAAAGACATTTGACGTGCGCATT 



Dpnl 

Msel Sau3AI | 

BsmAI I Taal Alwl | j 

M l III 

TGTCTCATTAACAGTTGGAGCAAGTCAAGGTCATTCTGCTACCTTCTATGATCCCATACT 



1261 



■+ 1320 



ACAGAGTAATTGTCAACCTCGTTCAGTTCCAGTAAGACGATGGAAGATACTAGGGTATGA 



CjePI 
Msel | 

Apol | | Hpyl88IX 
Tsp509I | |Hpyl78III BsaJI | 

III I II 

ACAAAGATATACTATACAAAACTCTATCCAAAAATTTAATCCTAATCCAGAACACCTCGG 

1321 + + + + + + 1380 

TGTTTCTATATGATATGTTTTGAGATAGGTTTTTAAATTAGGATTAGGTCTTGTGGAGCC 



Mnll 
BseRI | 

II 



CjePI 
I 



Hpyl78III 
Mspl 
BsaWI | 
BspEI | 
BciVI | j 
Mnll | | 
I 



Hpyl78III 
TaqI BssSI | 

i i i 



AACTATCTTGTTCTCCTCAACATATATTCCGGATACATCGACTTCTCGTGATGACTTCAT 

1381 + + + + + + 1440 

TTGATAGAACAAGAGGAGTTGTATATAAGGCCTATGTAGCTGAAGAGCACTACTGAAGTA 
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Fig. 20 (con't) 

TaqI 

Rsal Dpnl | 

BsrGI | Sau3AI | 

Taal | Bcefl | j 

Hpyl88IX TatI | Hpyl78IIl| j j 

I II II I I 

TTCACATTTCAGAAACCACATTGGACTGTACAACGGCACACTCGCTCTTGAAGATCGAGC 

1441 + + + + + + 1500 

AAGTGTAAAGTCTTTGGTGTAACCTGACATGTTGCCGTGTGAGCGAGAACTTCTAGCTCG 

Tsp509I 
Dpnl 
Bell 
Sau3AI 
Apol | 
MboII Tsp509I | 

I 



BsmFI 

Acelll | Alul 



Plel Hinfl Taal | CviJI 

II II I 

AGAGTGGAAAGTCTATAAATTTGATCAATTTGGTGGGACTCTACGGTTAGGCAGTAGAGC 

1501 + + + + + + 1560 

TCTCACCTTTCAGATATTTAAACTAGTTAAACCACCCTGAGATGCCAATCCGTCATCTCG 

MboII Msel 
RleAI | Tsp509I | 

II II 
TGTGTTTTCTACAACAGACGAAGAACAAAGTAGCAGTAGTGTGGGTTCTGTAATTAACAT 

1561 + + + + + + 1620 

ACACAAAAGATGTTGTCTGCTTCTTGTTTCATCGTCATCACACCCAAGACATTAATTGTA 

BslI 

Msel PflMI 
Tsp509I | Mnll Alul | 

CviRI | | Ddel | CviJI j 

III II M 

CAATAATCTTGCAATTAACCTTCCCTCTATCTTAGGCAACAGAGTTGCTCCCAAGCTATG 

162 1 + + + + + + 1680 

GTTATTAGAACGTTAATTGGAAGGGAGATAGAATCCGTTGTCTCAACGAGGGTTCGATAC 

Hinfl Sfcl 
Tfil RleAI | MboII 

I ll l 

GATTCGCCCCACAGGTTCATCAGCACCCTATAGCGAAGATAATAACCCTATAATCAATCT 

1681 + + + + + "-- + 1740 

CTAAGCGGGGTGTCCAAGTAGTCGTGGGATATCGCTTCTATTATTGGGATATTAGTTAGA 
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Fig. 20 (con't) 



Avail 
EcoO109I 
Psp5II 
Sau96I 
Sse8647I 
Hpyl78III | 
Ddel i I 
I I 



Dpnl 
BstYI 
Sau3AI 
Bfal | 

BseMII Fokl j 

CviJI BsrI Alwl | j 

I I IM 

CTCAGGACCTTTGAGCCTACTGGATGACGAGAACCTAGATCCCTATGATACTGCAGACCT 

1741 + + + + + + 

GAGTCCTGGAAACTCGGATGACCTACTGCTCTTGGATCTAGGGATACTATGACGTCTGGA 



BstAPI 
PstI | 
CviRI | | 
Sfcl I Mwol 



1800 



Aatll 
Maelll 
Tsp45I 
BsaHI | 

Mnll Mae I I j | Alul Msel 
MboII Earl | Ddel | j j CviJI Vspl 

I I I I III I I 

TGCCCAACCTATCGCAGAAGTTCCTCTTCTGTATCTCTTAGACGTCACAGCTAAACATAT 

1801 + + + + + + I860 

ACGGGTTGGATAGCGTCTTCAAGGAGAAGACATAGAGAATCTGCAGTGTCGATTTGTATA 



BseMII Bsu36I 
Tsp509I | Mnll Ddel SimI 



Mwol 
BsaJI 
Styl 
BsmFI | 
CviJI I 



Bsbl 

II i 

TAATACGGATAATTTCTACCCTGAGGGTCTAAATACAACTCAACACTACGGCTACCAAGG 

186 1 + + + + -_-- + + 1920 

ATTATGCCTATTAAAGATGGGACTCCCAGATTTATGTTGAGTTGTGATGCCGATGGTTCC 



NlalV 
Avail | 
Sau96I j 
Bcefl | | 
I 



Taql 
BsrI 
Dpnl 
Sau3AI | 
BslI | | 

I i 



Alwl 



Hpyl88IX 
MboII 



Mnll 
Earl | 
Hpyl88IX| | 



1921 



CGTTTGGTCCCCTTACTGGATCGAAACAATCACAACTTCTGATACCTCTTCTGAAGATAC 

+ + + + + + 

GCAAACCAGGGGAATGACCTAGCTTTGTTAGTGTTGAAGACTATGGAGAAGACTTCTATG 



1980 



MboII 
Taal I Eco57I 



BstXI 
Alul | 
CviJI | Sfcl 
Cac8I | j Hphl| 

i i i mi ii 

TGTGAATACTTTACATCGCCAGCTTTATGGTGATTGGACACCTACAGGATATAAGGTAAA 

1981 + + + + + + 2040 

ACACTTATGAAATGTAGCGGTCGAAATACCACTAACCTGTGGATGTCCTATATTCCATTT 
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Fig. 20 (con't) 

BsmAI BsrDI Mwol 

I I I 

CCCAGAAAACAAAGGAGACATTGCCCTATCTGCCTTCTGGCAATCTTTCCATAACTTATT 

2041 + + + + + + 2100 

GGGTCTTTTGTTTCCTCTGTAACGGGATAGACGGAAGACCGTTAGAAAGGTATTGAATAA 

Hpyl78III 
Cjel AlwNI 



Tthlllll 
CviJI | 
Hael | 

Maell Haelll j 



Alul 
CviJI 
Mwol | 
Sfcl I 



Plel 
Alul | 
CviJI | 
I I 

TGCGACACTACGTTATCAAACACAGCAAGGCCAAATAGCACCTACAGCTTCTGGAGAAGC 

2ioi + + + + + + 2160 

ACGCTGTGATGCAATAGTTTGTGTCGTTCCGGTTTATCGTGGATGTCGAAGACCTCTTCG 

Cjel 

Hinfl | CviRI Hinfl 

Taql | j Earl | Tfil Sthl32I Alul 

MboII I Bpml | | SfaNI CjePI XmnI | Ndel | CviJI 

I ill II! I I M I t I 

TACTCGACTCTTCGTGCATCAAAATAGCAACAATGATGCGAAAGGATTCCATATGGAAGC 

2161 + + + + + + 2220 

ATGAGCTGAGAAGCACGTAGTTTTATCGTTGTTACTACGCTTTCCTAAGGTATACCTTCG 

Tthlllll 
TspRI | 

CjePI Mnll | | Alul 

BscGI | Btsl| j j CviJI 

II II I I I 

TACGGGTTATTCTTTGGGAACAACCTCAAACACTGCTTCTAATCATAGCTTTGGTGTAAA 

2221 + + + + + + 2280 

ATGCCCAATAAGAAACCCTTGTTGGAGTTTGTGACGAAGATTAGTATCGAAACCACATTT 

BsaJI 
BstDSI 
Tsp509I | 
Hpyl8 8IX | | 
CviJI | | j CviJI 

Pf 111081 Hin4I j | |Bpll| 

I ! I I ! II 

CTTCTCCCAACTTTTCAGTAATCTCTACGAGAGCCACTCCGACAATTCCGTGGCTTCGCA 

2281 + + + + + + 2340 

GAAGAGGGTTGAAAAGTCATTAGAGATGCTCTCGGTGAGGCTGTTAAGGCACCGAAGCGT 
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Fig. 20 (con't) 



2341 



HaelV 
Hin4I 
Bbvl 
Dpnl 
Sau3AI | 
Hpyl78IIl| | 
Haell | | | | | CviRI 
Hhal | Mil! Fnu4HI | 

Eco47IIl|| || || j CviJl| j Hin4I 

Taal Ml | I I I I BsaJI | j | Hinf I 

Sfcl| III || || | Styl Tsel | j Tfil 

II III II III I Ml I 

TACGACAACTGTAGCGCTCCAGATCAATAATCCTTGGCTGCAAGAGAGATTCTCTACATC 

+ + + + + + 2400 

ATGCTGTTGACATCGCGAGGTCTAGTTATTAGGAACCGACGTTCTCTCTAAGAGATGTAG 



Mmel 
Bpml | 
I 



Sf cl 
CviJI 
Sf aNI 
Bfal | 
Mwol | 
CviRI Bpll| | 
I 



Hpyl78III 
SfaNI | 
Hpyl78III | j 



Sfcl 
Alul | 
| CviJI j 

II III 
TGCATCTCTAGCCTACAGCTACAGCAACCACCATATCAAAGCATCTGGATATTCTGGAAA 

2401 + + + + + + 2460 

ACGTAGAGATCGGATGTCGATGTCGTTGGTGGTATAGTTTCGTAGACCTATAAGACCTTT 



CviJI 
Fnu4HI | 
Taul | 

Rsal Acil| j 

I M I 

AATACAAACGGAAGGCAAATGTTATAGTACGACATTAGGGGCGGCTCTCTCTTGCTCTCT 

2461 + + + + + + 2520 

TTATGTTTGCCTTCCGTTTACAATATCATGCTGTAATCCCCGCCGAGAGAGAACGAGAGA 



BsaXI 
Hpyl78III | 

Dpnl | | Muni 
Sau3AI | | | Mnll Bcefl Tsp509I 

I I I I I I I 

ATCTCTACAATGGCGATCACGACCTCTCCACTTCACTCCTTTTATCCAAGCAATTGCCGT 

2521 + + + + + + 2580 

TAGAGATGTTACCGCTAGTGCTGGAGAGGTGAAGTGAGGAAAATAGGTTCGTTAACGGCA 
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Fig. 20 (con't) 



XmnI 
Apol 
Tsp509I 
Bf al | 
Alul| I 

Tthlllll Hpyl78III CviJI j j 

I I Ml 

TCGTTCTAATCAAACTGCGTTTCAAGAAAGTGGAGATAAAGCTAGAAAATTTTCTGTTCA 

2581 + + + + + + 2640 

AGCAAGATTAGTTTGACGCAAAGTTCTTTCACCTCTATTTCGATCTTTTAAAAGACAAGT 



Hinf I 
Bsp24I 
CjePI 
Cjel 



Haell 
Hhal 
EC047III 
Hpyl8 8IX 
Mnll 



BsmFI 



Hin4I 
Taal I 



Apol j 
EcoRI | 
Tsp509I | 
I I 

TAAACCCTTATATAACCTGACAGTCCCTCTGGGAATTCAGAGCGCTTGGGAATCCAAGTT 

2641 + + + + + + 2700 

ATTTGGGAATATATTGGACTGTCAGGGAGACCCTTAAGTCTCGCGAACCCTTAGGTTCAA 



I 



Bbsl 
MboII 
Tfil Aloll 



Cjel Cac8I 
CjePI Alul | Mnll 

Bsp24l| CviJI j CviJI BseMII | 

II II I M 

CCGTCTTCCTACCTATTGGAACATAGAGCTTGCTTATCAGCCTGTCCTCTACCAACAAAA 



2701 



■+ 2760 



GGCAGAAGGATGGATAACCTTGTATCTCGAACGAATAGTCGGACAGGAGATGGTTGTTTT 
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Fig. 20 (con't) 



Dpnl 
Sau3AI 
Bf al 



Plel 
Hinf I 
Tfil 
Hpyl78III 
Bfal 
Xbal 
Hinf I 
Maell 



Dpnl 
Sau3AI | 
Ddel | j 
Hpyl78III | | 



Drdll 
NlalV 
Hpyl78III 
NlalV 
CviJI | 
Nlalll | j 
Alwl | j j CjePI 

I III I 



TCCTGAGATCAACGTGAGTCTAGAATCTAGTGGATCGTCATGGCTCCTATCAGGAACCAC 



2761 



AGGACTCTAGTTGCACTCAGATCTTAGATCACCTAGCAGTACCGAGGATAGTCCTTGGTG 



-+ 2820 



BsrDI 
BsrDI | Dral 
Mwol | j Msel | 
CacBI | | | CjePI | | 
I 



MboII 
MboII | 
Apol | | 
Tsp509l| | 
II I 



CCTTGCTCGCAATGCCATTGCTTTTAAAGGAAGAAACCAAATTTTTATCTTCCCTAAACT 

2821 + + + + + + 2880 

GGAACGAGCGTTACGGTAACGAAAATTTCCTTCTTTGGTTTAAAAATAGAAGGGATTTGA 

Mnll 

Ddel CviJI BciVI | 

I I ii 

TTCGGTGTTCTTAGACTATCAAGGCTCGGTATCCTCATCAACGACGACACATTACCTTCA 

2881 + + + + + + 2940 

AAGCCACAAGAATCTGATAGTTCCGAGCCATAGGAGTAGTTGCTGCTGTGTAATGGAAGT 



Dral 
Msel | 



Nlalll 
Nspl 



Msel Msel | Nspl CviRI 

I 

CGCAGGAACGACCTTTAAGTTTTAAAAGCATGTTATATAGACAATGCAACCTGTAAAGAC 

2941 + + + + + + 3000 

GCGTCCTTGCTGGAAATTCAAAATTTTCGTACAATATATCTGTTACGTTGGACATTTCTG 



CjePI 



Hinf I 
Nlalll 
Tfil 
Hpyl78III | 
Real | | 
Bed | | | 
I I I 



CjePI 
BpulOI 
Ddel 
Alul | 
CviJI j 
I I 



CAAATAGAGAGTAGTGAACACTCTCTACCATCATGAATCTTATGGGAGAAGCTAAGGGAA 

3001 + + + + + + 3060 

GTTTATCTCTCATCACTTGTGAGAGATGGTAGTACTTAGAATACCCTCTTCGATTCCCTT 
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Fig. 20 (con't) 

NspV 
TaqI 

Fokl Hinfl | 

Msel Hin4I Mnll | 

Maell Tsp509I | Sthl32I Bfal Tfil | 

I II I I I I 

ATCCACAGATACGTTTCCCCCATAAAAATTAAGAACCCGATACATCCTCACTAGAGATTC 

3061 + + + + + + 3120 

TAGGTGTCTATGCAAAGGGGGTATTTTTAATTCTTGGGCTATGTAGGAGTGATCTCTAAG 

BsmI 
BpulOI | 
Msel Ddel | TaqI 

i ill 

GAAAGAACTACTTAAATCCTAAGCATTCGA 

3121 + + + 3150 

CTTTCTTGATGAATTTAGGATTCGTAAGCT 
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Figure 21A: CPN100626 Coding Sequence 



tcctgaactc 
tgtgacctac 
cactctctcc 
cgtggcgagt 
tctagtgtca 
gttcgggatg 
agagagggat 
aggattttct 
actttctaac 
tgtgtcctgt 
tctcgcgatt 
tagaggaccc 
aactcgtggg 
tttctacttt 
ctgccgcatc 
agggggtgcg 
taagaacaac 
aaataactcc 
aggaaatggt 
aactattctt 
tttgacaatc 
tgctcttatg 
atttcaaaat 
accaaatagc 
tatagaacac 
gggaacgatc 
cattagcagc 
tgcgggatgg 
ggcgagtatt 
catcattaat 
cttgtggatc 
tactttatca 
agatctctct 
tcatatcaat 
tcaaggcatc 
agagacggca 
ggtcaatcct 
tatgttctct 
cttagaaatt 
tccaggattc 
acatcagaaa 
aagcaacaac 
ccaagaggcc 
cgcctacatc 
agcagctatc 
cgttcaggca 
ccgaaagttt 
aaaatggcag 
ggtactctat 
ggatatccta 
tgcgctcttc 
atctacgcac 
attgaaatct 
ctccctcaaa 



cactcgaaat 
aatgctttag 
ttagattact 
agaatgagat 
aagactcctc 
actcctgcag 
gaagagtttc 
ccaataacta 
tacaagtcta 
aataatttct 
gggactggag 
cttatttttt 
ggtgcgattg 
gtcaacaatt 
caaagcaaca 
cttcgtagtg 
tgtgggaaca 
gggtcggtga 
tcaggagggg 
ttcaataata 
aaaaatagtg 
ctcctacagg 
aatgaggttt 
aacttacaac 
caacatccaa 
ttattttctt 
tcgaaaaata 
caattctata 
gcaacaactg 
aaccttgcga 
cgtcctctac 
ggtcctctga 
gagcctttac 
accgataact 
tggtctcctt 
aacaccctct 
gaataccaag 
ctattaagaa 
caagggattg 
cgtatccaat 
atctccttag 
gtctcggctc 
tttgcaacat 
cgtcacatca 
ggctgttctt 
attgcaatac 
gtctctcaaa 
tcaaaattcc 
caacaaaatc 
ggccataact 
cgttctctcg 
catctccaag 
ttagaattaa 
atttaaataa 



tactgattag 
ggatcaaagt 
ctgcggatat 
ttttaacaat 
ctaagttttt 
tgtatagttt 
gtacgagctt 
cgtttgttgg 
ttgataacat 
tattatcaaa 
gcgcgattgc 
tcagcaatcg 
cctgtaatgg 
ccgtcaacaa 
gggcacctct 
aaaatacaac 
atggcggggc 
ttttcaataa 
cgatttatac 
actactgcat 
gccacgtata 
acagcacctg 
tcctcaccac 
ttggagctaa 
ctacaaatcc 
cagcctatat 
cctctgaact 
agttcactca 
ccaactctga 
ttaacctccc 
aatctagtgc 
cactcttaaa 
aaaacattca 
ttcatcctga 
attgggtaga 
acagagctct 
gagatcttgc 
gttataatcg 
ccgacggcct 
ctacagggta 
gttttgcaca 
acaatacagt 
cccacagttt 
agaacagggc 
tcccttggca 
gttctcacca 
agcctttcta 
acgtacctac 
cccaaatcgg 
atgttcgcaa 
atctattctt 
caggaagtac 
caactatccg 



ccaaggtacg 
gaaaaatacc 
ttcttcctcc 
aagtgaccaa 
attctatctc 
acaaacggac 
tcctctctta 
aaatagacat 
ccttcttctt 
tgttgaagac 
ttgccaggga 
aggtcttaac 
agacttcacg 
ctggggagga 
actctttttt 
gatctctgat 
cattcaaaca 
caacacagcg 
aacaaaccta 
tcgcgatggc 
tttcaccaac 
cctactcttc 
atttggtaga 
taaggggtat 
tctaatcttt 
cccagaagct 
tcgcaatggt 
aaaaggaggt 
gactccatca 
ctcgatctta 
tcctttcaca 
tgaggaaaac 
tcttctttct 
aagcttaaat 
gacgataaca 
gtatgccaat 
tacgactccc 
aactggtgat 
ctttgttcat 
ttccttacaa 
gttcttcacc 
ctcttcactt 
agcgtatggc 
agaagggacg 
acagaaatcc 
aacagcgttc 
taatctgacc 
agaatggact 
tgtcacgcta 
tgctttaggg 
ggattaccaa 
cttaaaattc 
atgagctacg 



tggacgacgc 
atgcaggtgt 
acgctgagtc 
aacagaaaga 
gggaatttca 
tcccttgaaa 
gactctctct 
aattcctctc 
tggacatcgg 
catgccttct 
gcctgcacaa 
aatgcgagta 
atttctcaaa 
gccctctcca 
aacaatacag 
aacacgcgtc 
agcgttactg 
ttatctggtt 
tccatagacg 
ggagctatct 
aatcaaggaa 
gcggaacaag 
tacaacgcca 
acgactgctt 
aatcccaatg 
tctgactacg 
gtcctctcta 
atccttaaat 
actagtgtag 
gcaaaaggaa 
gaggacaata 
cgcgatccct 
ttatcggatg 
gcgactgagc 
acaacaaata 
tggactccct 
ctatggcaat 
tctgatatcg 
caaaatagca 
gcatcctccg 
cgcactaaag 
tatgttgagc 
tatggggacc 
tgttatagcc 
tatcttcacc 
gaagagattg 
ttacctctag 
ctagaacttt 
cttgcgagcg 
tacaaagtcc 
ggatcggtct 
taaaataaaa 
ttagcccaat 



aggccactcc 
ttcctaaagt 
actacttaaa 
ttaaggaacc 
cagcctgcat 
agtttgcttt 
ccactcttac 
aagacattgt 
ctgggggagc 
tcagtaaaaa 
tcacgaagaa 
caggaggaga 
atcaagggac 
ccaatggaca 
cccctagtgg 
ctatttattt 
ttgcgataaa 
cgataaattc 
ataaccctgg 
gtacacaatt 
actggggagg 
gaaatatcgc 
tacattgtac 
tttttgatcc 
cgaaccatca 
aaaataattt 
tcgaggatcg 
tagggcatgc 
gctcccaggt 
aagctcctac 
accctacaat 
acgacagtat 
taacagcacg 
attacggtta 
acgcttctat 
taggatataa 
cctttcatac 
agaggccttt 
tccccggggc 
aaacttcttt 
aaatcggatc 
ttccgtggtt 
atcacctcca 
atacattagc 
tcagcccgtt 
gtgacaatcc 
gaatccaagg 
cttaccaacc 
gaggttcctg 
acaatcaaac 
cctcctcgac 
gaacgataaa 
cggtagagga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 

3120 

3180 

3200 
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Figure 21 B: CPN 100626 Deduced Amino Acid Sequence 

Met Gin Val Phe Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp 
15 10 15 

lie Ser Ser Ser Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met 
20 25 30 

Arg Phe Leu Thr lie Ser Asp Gin Asn Arg Lys lie Lys Glu Pro Leu 
35 40 45 

Val Ser Lys Thr Pro Pro Lys Phe Leu Phe Tyr Leu Gly Asn Phe Thr 
50 55 60 

Ala Cys Met Phe Gly Met Thr Pro Ala Val Tyr Ser Leu Gin Thr Asp 
65 70 75 80 

Ser Leu Glu Lys Phe Ala Leu Glu Arg Asp Glu Glu Phe Arg Thr Ser 
85 90 95 

Phe Pro Leu Leu Asp Ser Leu Ser Thr Leu Thr Gly Phe Ser Pro lie 
100 105 110 

Thr Thr Phe Val Gly Asn Arg His Asn Ser Ser Gin Asp lie Val Leu 
115 120 125 

Ser Asn Tyr Lys Ser lie Asp Asn lie Leu Leu Leu Trp Thr Ser Ala 
130 135 140 

Gly Gly Ala Val Ser Cys Asn Asn Phe Leu Leu Ser Asn Val Glu Asp 
145 150 155 160 

His Ala Phe Phe Ser Lys Asn Leu Ala lie Gly Thr Gly Gly Ala lie 
165 170 175 

Ala Cys Gin Gly Ala Cys Thr lie Thr Lys Asn Arg Gly Pro Leu lie 
180 185 190 

Phe Phe Ser Asn Arg Gly Leu Asn Asn Ala Ser Thr Gly Gly Glu Thr 
195 200 205 

Arg Gly Gly Ala lie Ala Cys Asn Gly Asp Phe Thr lie Ser Gin Asn 
210 215 220 

Gin Gly Thr Phe Tyr Phe Val Asn Asn Ser Val Asn Asn Trp Gly Gly 
225 230 235 240 

Ala Leu Ser Thr Asn Gly His Cys Arg lie Gin Ser Asn Arg Ala Pro 
245 250 255 

Leu Leu Phe Phe Asn Asn Thr Ala Pro Ser Gly Gly Gly Ala Leu Arg 
260 265 270 

Ser Glu Asn Thr Thr lie Ser Asp Asn Thr Arg Pro lie Tyr Phe Lys 
275 280 285 
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Fig. 21 B (con't) 



Asn Asn Cys Gly Asn Asn Gly Gly Ala lie Gin Thr Ser Val Thr Val 
290 295 300 

Ala lie Lys Asn Asn Ser Gly Ser Val lie Phe Asn Asn Asn Thr Ala 
305 310 315 320 

Leu Ser Gly Ser lie Asn Ser Gly Asn Gly Ser Gly Gly Ala lie Tyr 
325 330 335 

Thr Thr Asn Leu Ser lie Asp Asp Asn Pro Gly Thr lie Leu Phe Asn 
340 345 350 

Asn Asn Tyr Cys lie Arg Asp Gly Gly Ala lie Cys Thr Gin Phe Leu 
355 360 365 

Thr lie Lys Asn Ser Gly His Val Tyr Phe Thr Asn Asn Gin Gly Asn 
370 375 380 

Trp Gly Gly Ala Leu Met Leu Leu Gin Asp Ser Thr Cys Leu Leu Phe 
385 390 395 400 

Ala Glu Gin Gly Asn lie Ala Phe Gin Asn Asn Glu Val Phe Leu Thr 
405 410 415 

Thr Phe Gly Arg Tyr Asn Ala lie His Cys Thr Pro Asn Ser Asn Leu 
420 425 430 

Gin Leu Gly Ala Asn Lys Gly Tyr Thr Thr Ala Phe Phe Asp Pro lie 
435 440 445 

Glu His Gin His Pro Thr Thr Asn Pro Leu lie Phe Asn Pro Asn Ala 
450 455 460 

Asn His Gin Gly Thr lie Leu Phe Ser Ser Ala Tyr lie Pro Glu Ala 
465 470 475 480 

Ser Asp Tyr Glu Asn Asn Phe lie Ser Ser Ser Lys Asn Thr Ser Glu 
485 490 495 

Leu Arg Asn Gly Val Leu Ser lie Glu Asp Arg Ala Gly Trp Gin Phe 
500 505 510 

Tyr Lys Phe Thr Gin Lys Gly Gly lie Leu Lys Leu Gly His Ala Ala 
515 520 525 

Ser lie Ala Thr Thr Ala Asn Ser Glu Thr Pro Ser Thr Ser Val Gly 
530 535 540 

Ser Gin Val lie lie Asn Asn Leu Ala lie Asn Leu Pro Ser lie Leu 
545 550 555 560 

Ala Lys Gly Lys Ala Pro Thr Leu Trp lie Arg Pro Leu Gin Ser Ser 



565 



570 



575 
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Fig. 21 B (con't) 



Ala Pro Phe Thr Glu Asp Asn Asn Pro Thr lie Thr Leu Ser Gly Pro 
580 585 590 

Leu Thr Leu Leu Asn Glu Glu Asn Arg Asp Pro Tyr Asp Ser lie Asp 
595 600 605 

Leu Ser Glu Pro Leu Gin Asn lie His Leu Leu Ser Leu Ser Asp Val 
610 615 620 

Thr Ala Arg His lie Asn Thr Asp Asn Phe His Pro Glu Ser Leu Asn 
625 630 635 640 

Ala Thr Glu His Tyr Gly Tyr Gin Gly lie Trp Ser Pro Tyr Trp Val 
645 650 655 

Glu Thr lie Thr Thr Thr Asn Asn Ala Ser lie Glu Thr Ala Asn Thr 
660 665 670 

Leu Tyr Arg Ala Leu Tyr Ala Asn Trp Thr Pro Leu Gly Tyr Lys Val 
675 680 685 

Asn Pro Glu Tyr Gin Gly Asp Leu Ala Thr Thr Pro Leu Trp Gin Ser 
690 695 700 

Phe His Thr Met Phe Ser Leu Leu Arg Ser Tyr Asn Arg Thr Gly Asp 
705 710 715 720 

Ser Asp lie Glu Arg Pro Phe Leu Glu lie Gin Gly lie Ala Asp Gly 
725 730 735 

Leu Phe Val His Gin Asn Ser lie Pro Gly Ala Pro Gly Phe Arg lie 
740 745 750 

Gin Ser Thr Gly Tyr Ser Leu Gin Ala Ser Ser Glu Thr Ser Leu His 
755 760 765 

Gin Lys lie Ser Leu Gly Phe Ala Gin Phe Phe Thr Arg Thr Lys Glu 
770 775 780 

lie Gly Ser Ser Asn Asn Val Ser Ala His Asn Thr Val Ser Ser Leu 
785 790 - 795 800 

Tyr Val Glu Leu Pro Trp Phe Gin Glu Ala Phe Ala Thr Ser His Ser 
805 810 815 

Leu Ala Tyr Gly Tyr Gly Asp His His Leu His Ala Tyr lie Arg His 
820 825 830 

lie Lys Asn Arg Ala Glu Gly Thr Cys Tyr Ser His Thr Leu Ala Ala 
835 840 845 

Ala He Gly Cys Ser Phe Pro Trp Gin Gin Lys Ser Tyr Leu His Leu 
850 855 860 
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Fig. 21 B (con't) 

Ser Pro Phe Val Gin Ala lie Ala lie Arg Ser His Gin Thr Ala Phe 
865 870 875 880 

Glu Glu lie Gly Asp Asn Pro Arg Lys Phe Val Ser Gin Lys Pro Phe 
885 890 895 

Tyr Asn Leu Thr Leu Pro Leu Gly lie Gin Gly Lys Trp Gin Ser Lys 
900 905 910 

Phe His Val Pro Thr Glu Trp Thr Leu Glu Leu Ser Tyr Gin Pro Val 
915 920 925 

Leu Tyr Gin Gin Asn Pro Gin lie Gly Val Thr Leu Leu Ala Ser Gly 
930 935 940 

Gly Ser Trp Asp lie Leu Gly His Asn Tyr Val Arg Asn Ala Leu Gly 
945 950 955 960 

Tyr Lys Val His Asn Gin Thr Ala Leu Phe Arg Ser Leu Asp Leu Phe 
' 965 970 975 

Leu Asp Tyr Gin Gly Ser Val Ser Ser Ser Thr Ser Thr His His Leu 
980 985 990 

Gin Ala Gly Ser Thr Leu Lys Phe 
995 1000 
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Figure 22 (RY-45) 

Restriction enzyme analysis of CPN 100626 



MslI 
Cjel 
Hgal 



Hpyl78III 



Tsp509I 
TaqI j Cjel 



BsaJI BsaAI 
Styl Maell| 
CviJI | Rsal | | 
II III 

TCCTGAACTCCACTCGAAATTACTGATTAGCCAAGGTACGTGGACGACGCAGGCCACTCC 

X + + + + + + 60 

AGGACTTGAGGTGAGCTTTAATGACTAATCGGTTCCATGCACCTGCTGCGTCCGGTGAGG 



CviJI 
Hael 
Haelll 
Cac8I | 
I I 



Aarl 
MslI | 

Dpnl BspMI CviRI | j Maelll 

Sau3AI | Alwl | Nlalll | j Tsp45I 

| I I I I III I 

TGTGACCTACAATGCTTTAGGGATCAAAGTGAAAAATACCATGCAGGTGTTTCCTAAAGT 

6i + + + + + + 120 

ACACTGGATGTTACGAAATCCCTAGTTTCACTTTTTATGGTACGTCCACAAAGGATTTCA 



Maelll 
Tsp45I 



Ddel 



BseMII 
Bsp24I | 
MboII CjePI | 
Acil| Cjel | j 
II II I 



Maelll 
Tsp45I 
Hinf I | 
Mnll j 
Ddel | j 
I II 



Maell 
Msel | 
Plel | j 



CACTCTCTCCTTAGATTACTCTGCGGATATTTCTTCCTCCACGCTGAGTCACTACTTAAA 



121 



-+ 180 



GTGAGAGAGGAATCTAATGAGACGCCTATAAAGAAGGAGGTGCGACTCAGTGATGAATTT 



Cjel 
CjePI 
Bsp24I | 



Maelll 

Msel Tsp45I Msel NlalV 

ii ii 

CGTGGCGAGTAGAATGAGATTTTTAACAATAAGTGACCAAAACAGAAAGATTAAGGAACC 

181 + + + + + + 240 

GCACCGCTCATCTTACTCTAAAAATTGTTATTCACTGGTTTTGTCTTTCTAATTCCTTGG 



Hinf I 
Mnll 
Plel | 
BseRI | j 
Bfal | | | 
II 



Ddel 



Apol CviRI 

Tsp509I Cac8I | 

Sthl32I Hpyl78III | Sthl32I j 

Mnll | Aval | | CviJI | j 

II III Ml 



TCTAGTGTCAAAGACTCCTCCTAAGTTTTTATTCTATCTCGGGAATTTCACAGCCTGCAT 

24i + + + + + + 300 

AGATCAC AGTTTCTG AGGAGG ATTCAAAAATAAGATAGAG CC C TT AAAGTGT CGGACGT A 
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Fig. 22 (con't) 



BtsI 

Hpyl7 8III Fokl 
Plel| Pstl 
Nlalll | I CviRI | 

Nspl j|HinfI Sfcl | |TspRI Plel Hinfl Mnll 

I II I I I I I II I 

GTTCGGGATGACTCCTGCAGTGTATAGTTTACAAACGGACTCCCTTGAAAAGTTTGCTTT 



301 



CAAGCCCTACTGAGGACGTCACATATCAAATGTTTGCCTGAGGGAACTTTTCAAACGAAA 



-+ 360 



Hin4I 
Alul 
CviJI 
MboII 
Rsal | 
Fokl | j 

Earl XmnI SunI j | 

I ! I II 

AGAGAGGGATGAAGAGTTTCGTACGAGCTTTCCTCTCTTAGACTCTCTCTCCACTCTTAC 



BplI 
BsaXI 
Hin4I 
Mull 
Hinfl | 
Ddel | | 
Plel | | | 



361 



-+ 420 



TCTCTCCCTACTTCTCAAAGCATGCTCGAAAGGAGAGAATCTGAGAGAGAGGTGAGAATG 



Hpyl78III Rsal 

Cjel Tsp509I Smll | TatI | 

Mmel I Maell Bce83I I Cjel | |MnlI | j 

II I I I I I I I I I 

AGGATTTTCTCCAATAACTACGTTTGTTGGAAATAGACATAATTCCTCTCAAGACATTGT 



421 



-+ 480 



TCCTAAAAGAGGTTATTGATGCAAACAACCTTTATCTGTATTAAGGAGAGTTCTGTAACA 



Bsp24I Alul 
Cjel CviJI CviJI 

Fokl MboII CjePI Acelll | Mwol | 

i i i i i i i 

ACTTTCTAACTACAAGTCTATTGATAACATCCTTCTTCTTTGGACATCGGCTGGGGGAGC 



481 



-+ 540 



TGAAAGATTGATGTTCAGATAACTATTGTAGGAAGAAGAAACCTGTAGCCGACCCCCTCG 



Tsp509I 
Cjel | 
CjePI | | 
Bsp24I | | | 
III I 



MboII 
Nlalll | 
Bbsl | | 

EC057I MboII | j | 
I I I I I 



CjePI 



TGTGTCCTGTAATAATTTCTTATTATCAAATGTTGAAGACCATGCCTTCTTCAGTAAAAA 

541 + + + + + + 600 

ACACAGGACATTATTAAAGAATAATAGTTTACAACTTCTGGTACGGAAGAAGTCATTTTT 
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Fig. 22 (con't) 



Mnll 
Hpyl78III 
CviRI 
Cac8I 
CviJI 
NlalV 
Bpml 
ScrFI 
BsaJl ] 
EcoRII 
Cac8I 
CjePI 
Bsgl 

BsrI BsmFI | 
Tthlllll |HhaI | j 
Mnll | j Thai j j 

I I II I I I 

TCTCGCGATTGGGACTGGAGGCGCGATTGCTTGCCAGGGAGCCTGCACAATCACGAAGAA 



Hpyl7 8III 
Nrul 
Thai 



601 



AGAGCGCTAACCCTGACCTCCGCGCTAACGAACGGTCCCTCGGACGTGTTAGTGCTTCTT 



-+ 660 



MbOlI 

Niaiv 
Avail 
ECO0109I 
PspSII 
Sau96I 
SimI 

TAGAGGACCCCTTATTTTTTTCAGCAATCGAGGTCTTAACAATGCGAGTACAGGAGGAGA 

661 + + + + + + 720 

ATCTCCTGGGGAATAAAAAAAGTCGTTAGCTCCAGAATTGTTACGCTCATGTCCTCCTCT 



Mnll TaqI 



Msel 

i 



Rsal 
Mnll | 
TatI 



BSSSI BseRI BsmAI * Hpyl78III 

II I I 

AACTCGTGGGGGTGCGATTGCCTGTAATGGAGACTTCACGATTTCTCAAAATCAAGGGAC 



721 



-+ 780 



TTGAGCACCCCCACGCTAACGGACATTACCTCTGAAGTGCTAAAGAGTTTTAGTTCCCTG 



Banll 
Bspl286I 
BsrI | 
Bmrl | CviJI j Fokl 
Mnll | |Hin4l| j Mnll 
Hindi | j |NlalV| j BseRI | Bpll 

III I III I I I II I 

TTTCTACTTTGTCAACAATTCCGTCAACAACTGGGGAGGAGCCCTCTCCACCAATGGACA 

781 + + + + + + 840 

AAAGATGAAACAGTTGTTAAGGCAGTTGTTGACCCCTCCTCGGGAGAGGTGGTTACCTGT 



Tsp509I 
Hindi • | 
BsmFI I 



108/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Fig. 22 (corVt) 

TspRI 
Acil | 
Fnu4HI I 
Taul j 
BtsI | j 
II 



SfaNI 
Mwol | 
I 



Bspl286I 
NlalV 
Bmgl | 
BseSX j 
Banl | j 



Msel 
Mnll I 



BslI 
EcoNI 
Bf al | 
Mnll | | 
CviJI | | | 

I III 



841 



CTGCCGCATCCAAAGCAACAGGGCACCTCTACTCTTTTTTAACAATACAGCCCCTAGTGG 

+ + + + + h 

GACGGCGTAGGTTTCGTTGTCCCGTGGAGATGAGAAAAAATTGTTATGTCGGGGATCACC 



900 



Hpyl8 8IX 

Hgal | Thai 

Pflll08I Dpnl | j AflHI | 

Hhal | Sau3AI | j j Mlul j CjePI Msel 

II I I II II II 

AGGGGGTGCGCTTCGTAGTGAAAATACAACGATCTCTGATAACACGCGTCCTATTTATTT 

901 + + + + + + 960 

TCCCCCACGCGAAGCATCACTTTTATGTTGCTAGAGACTATTGTGCGCAGGATAAATAAA 



CviJI 
Haelll 
CjePI | 
Faul Nlaivj 
Sthl32I | Sau96I | | 
RleAI Taal | j Acil | j j 

I I II I III III 

TAAGAACAACTGTGGGAACAATGGCGGGGCCATTCAAACAAGCGTTACTGTTGCGATAAA 

961 + + + + + + 1020 

ATTCTTGTTGACACCCTTGTTACCGCCCCGGTAAGTTTGTTCGCAATGACAACGCTATTT 



Tthlllll 
Maelll Taal I 



SimI Apol 

Neil | Tsp509I 

ScrFI | TaqI | 

Sthl32I Mspl| | HphI Bsbl Drdll | | 

I III II Ml 

AAATAACTCCGGGTCGGTGATTTTCAATAACAACACAGCGTTATCTGGTTCGATAAATTC 

1021 + + + + + + 1080 

TTTATTGAGGCCCAGCCACTAAAAGTTATTGTTGTGTCGCAATAGACCAAGCTATTTAAG 



Hpyl78III 

Drdll | ScrFI 

Mnll | j BsaJI j 

Hpyl78III Xmnl|| j EcoRII | 

I III I ' M 

AGGAAATGGTTCAGG AGGGG C G ATTTATAC AACAAAC C TATC CATAGACG ATAAC C C TGG 

1081 + + + + + + 1140 

TCCTTTACCAAGTCCTCCCCGCTAAATATGTTGTTTGGATAGGTATCTGCTATTGGGACC 
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Fig. 22 (con't) 



Alul 
CviJI 
Ecil 
Acil 



Bed 



Hpyl78III 
BsmI Nrul 
CviRI Thai 



Tsp509I 
Rsal | 
BsrGI | | 
TatI 



AACTATTCTTTTCAATAATAACTACTGCATTCGCGATGGCGGAGCTATCTGTACACAATT 



1141 



■+ 1200 



TTGATAAGAAAAGTTATTATTGATGACGTAAGCGCTACCGCCTCGATAGACATGTGTTAA 



BsaAI 
HphI 
Maell 
CviJI 
Hael 
Haelll 

MSCl 

Eael | 
I I 



BsrI 
Bmrl | 
Mnll | j 
I I I 



1201 



TTTGACAATCAAAAATAGTGGCCACGTATATTTCACCAACAATCAAGGAAACTGGGGAGG 

+ + + + h 

AAACTGTTAGTTTTTATCACCGGTGCATATAAAGTGGTTGTTAGTTCCTTTGACCCCTCC 



1260 



BsiHKAI 
Bspl286I 



Sf cl 



MboII 
Aarl | 
AlwNI j 

II 



Acil 
Earl 
BspMI Thai 
I 



TGCTCTTATGCTCCTACAGGACAGCACCTGCCTACTCTTCGCGGAACAAGGAAATATCGC 



1261 



-+ 1320 



1321 



ACGAGAATACGAGGATGTCCTGTCGTGGACGGATGAGAAGCGCCTTGTTCCTTTATAGCG 

Rsal 

Cjel BsrGI | 

Cjel | Mnll Cjel | | 

Mnll HphI j HgiEIl| Cjel Cjel |TatI | 

I II III I I I I 

ATTTCAAAATAATGAGGTTTTCCTCACCACATTTGGTAGATACAACGCCATACATTGTAC 

+ + + h H h 

TAAAGTTTTATTACTCCAAAAGGAGTGGTGTAAACCATCTATGTTGCGGTATGTAACATG 

Fokl 
Sfcl 
Dpnl | 

Alul BstZ17I Sau3AI | | 

Cjel CviJI Accll Alwl | I I 

I i II I I I I 

ACCAAATAGCAACTTACAACTTGGAGCTAATAAGGGGTATACGACTGCTTTTTTTGATCC 



1380 



1381 



-+ 1440 



TGGTTTATCGTTGAATGTTGAACCTCGATTATTCCCCATATGCTGACGAAAAAAACTAGG 
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Fig. 22 (con't) 



Mmel 

Mnll | Bed 
Msel j Drdll | 

ii ii 

TATAGAACACCAACATCCAACTACAAATCCTCTAATCTTTAATCCCAATGCGAACCATCA 



1441 



1500 



ATATCTTGTGGTTGTAGGTTGATGTTTAGGAGATTAGAAATTAGGGTTACGCTTGGTAGT 



MboII 
Dpnl | 
Sau3AI | | 
Eco57I | j | 
I II 



CviJI 



Pf 111081 
Hpyl8 8IX 
Alul | 
CviJI j 
Hindlll | | |Tsp509I 

i I 



GGGAACGATCTTATTTTCTTCAGCCTATATCCCAGAAGCTTCTGACTACGAAAATAATTT 



1501 



CCCTTGCTAGAATAAAAGAAGTCGGATATAGGGTCTTCGAAGACTGATGCTTTTATTAAA 



-+ 1560 



Taql 
Alul 
CviJI 
Fnu4HI | 
Tsel| j 



Dpnl 
Faul 
Sau3AI 

Hpyl88IX Sthl32l| 
Acelll | Bcgl Mnll | j 

Bbvll Mnll BsrDl| Mnll Taql ||| 

III I II I I Ml 

CATTAGCAGCTCGAAAAATACCTCTGAACTTCGCAATGGTGTCCTCTCTATCGAGGATCG 

1561 + + + + + + 1620 

GTAATCGTCGAGCTTTTTATGGAGACTTGAAGCGTTACCACAGGAGAGATAGCTCCTAGC 

Fnu4HI 
Taul 
Acil | 
Nlalll | 
BciVI Nspl j 

Tsp509I | Sphlj 
Mnll Msel | j Cac8I | j 

I III IN 



Alwl Tsp509I Fokl 
Acil| Bed | Bcgl | 
II III 



1621 



TGCGGGATGGCAATTCTATAAGTTCACTCAAAAAGGAGGTATCCTTAAATTAGGGCATGC 

+ + + + + + 

ACGCCCTACCGTTAAGATATTCAAGTGAGTTTTTCCTCCATAGGAATTTAATCCCGTACG 



1680 



Bed 
Hin4I 
Hinfl 
BseMII Ddel 
BstAPl| Hpyl8 8IX 
CviRI | | Plel | 

Mwol | Mwol|BsmAI | | 
II II I I I 



Bfal 
Spel| 

I I 



ScrFI 
BsaJI 
EcoRII 
NlalV | 
CviJI | | 



GGCGAGTATTGCAACAACTGCCAACTCTGAGACTCCATCAACTAGTGTAGGCTCCCAGGT 

16B1 + + + + + + 1740 

CCGCTCATAACGTTGTTGACGGTTGAGACTCTGAGGTAGTTGATCACATCCGAGGGTCCA 
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Fig. 22 (con't) 



Mnll 
Ddel 
Dpnl 
Mnll 

Sau3AI | j j Alul 
Msel Taql| j I I CviJI 

I I I I 

CATCATTAATAACCTTGCGATTAACCTCCCCTCGATCTTAGCAAAAGGAAAAGCTCCTAC 



Msel 
Vspl 



1741 



-+ 1800 



GTAGTAATTATTGGAACGCTAATTGGAGGGGAGCTAGAATCGTTTTCCTTTTCGAGGATG 



Alwl 
Hin4I | 
Dpnl | | 
NlalV | | Mnll 
BamHI | j | BsiHKAI 
BstYI | j j Bspl286I 
Sau3AI | | j Bfal I 

Alwl Mil I I Tsp509I 

I II I I III I 

CTTGTGGATCCGTCCTCTACAATCTAGTGCTCCTTTCACAGAGGACAATAACCCTACAAT 

1801 + + + + + + I860 

GAACACCTAGGCAGGAGATGTTAGATCACGAGGAAAGTGTCTCCTGTTATTGGGATGTTA 



Hpyl8 8IX 
Avail 
ECOO109I 
PspSII 
Sau96I 
Sse8647I 

I 



Msel 
Mnll | 
Mnll | j 
III 



Pf 111081 
Dpnl 
Sau3AI | 
Thai | | 
Acil | | | 
Alwl | | | | 
II 



BseMII 
Taal I 



TACTTTATCAGGTCCTCTGACACTCTTAAATGAGGAAAACCGCGATCCCTACGACAGTAT 



1861 



-+ 1920 



ATGAAATAGTCCAGGAGACTGTGAGAATTTACTCCTTTTGGCGCTAGGGATGCTGTCATA 



CviJI 
Ddel 
Hpyl8 8IX 
Dpnl 
Bglll | 

BstYI j j j Maelll Fokl 

Sau3AI I | MboII Hpyl88IX | Maell | 

I I I M 

AGATCTCTCTGAGGCTTTACAAAACATTCATCTTCTTTCTTTATCGGATGTAACAGCACG 

192 i + + + + + + 1980 

TCTAGAGAGACTCGGAAATGTTTTGTAAGTAGAAGAAAGAAATAGCCTACATTGTCGTGC 
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Fig. 22 (con't) 



Fokl 



BseMII 
Msel 
Alul 
CviJI 
Hindlll | 
Hpyl78III | I 
I t 



Ddel 



Taal 



TCATATCAATACCGATAACTTTCATCCTGAAAGCTTAAATGCGACTGAGCATTACGGTTA 



1981 



-+ 2040 



AGTATAGTTATGGCTATTGAAAGTAGGACTTTCGAATTTACGCTGACTCGTAATGCCAAT 



BsmAI 

Bsal | Sfcl 

BsmAI | BsmAI] 

SfaNI IBsmBI BsmBI | 

III M 
TCAAGGCATCTGGTCTCCTTATTGGGTAGAGACGATAACAACAACAAATAACGCTTCTAT 



2041 



-+ 2100 



AGTTCCGTAGACCAGAGGAATAACCCATCTCTGCTATTGTTGTTGTTTATTGCGAAGATA 



Banll 
BsiHKAI 
Bspl286I 
CjePI 
SacI 
Alul 
CviJI 
Mnll 
Tthlllll | 
Beef I | | 
Sfcl | II 



Muni 
Plel 
Tsp509I 



BSU36I 



Mwol 



| Hinfl Ddel 



HaelV 
Hin4I 



2101 



AGAGACGGCAAACACCCTCTACAGAGCTCTGTATGCCAATTGGACTCCCTTAGGATATAA 

+ + + + + + 2160 

TCTCTGCCGTTTGTGGGAGATGTCTCGAGACATACGGTTAACCTGAGGGAATCCTATATT 



Dpnl 
Bglll 
BstYI 
Sau3AI j Hinfl 
|Pflll08I | 
[Plel | | 
I I I 

GGTCAATCCTGAATACCAAGGAGATCTTGCTACGACTCCCCTATGGCAATCCTTTCATAC 

2161 + + + + + + 2220 

CCAGTTAGGACTTATGGTTCCTCTAGAACGATGCTGAGGGGATACCGTTAGGAAAGTATG 



Hpyl78III BsaJI 
CjePI | Styl 

I 
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Fig. 22 (con't) 



CviJI 
Hael 
Haelll 
Hpyl78III 
Taql 
HphI 



Msel 
I 



Taql 
I 



EcoRV 
Mnll 
Hpyl88IX 
Hinfl | 
Tf il I 
Bsrl| | 
II I 



Stul 



I 



TATGTTCTCTCTATTAAGAAGTTATAATCGAACTGGTGATTCTGATATCGAGAGGCCTTT 

2221 + + + + + + 2280 

ATACAAGAGAGATAATTCTTCAATATTAGCTTGACCACTAAGACTATAGCTCTCCGGAAA 

NlalV 
Sthl32I 
CviJI 
SfaNI 
Neil 
ScrFI 
Smal 
BsaJI 
Mspl 
Neil 

Sthl32I ScrFI 
Beef I | Avalj 

Mill I | | BsaJI j 

Fokl | j Bpml | BsaJI | j 

| j I | I I II III 

CTTAGAAATTCAAGGGATTGCCGACGGCCTCTTTGTTCATCAAAATAGCATCCCCGGGGC 

2281 + + + + + + 2340 

GAATCTTTAAGTTCCCTAACGGCTGCCGGAGAAACAAGTAGTTTTATCGTAGGGGCCCCG 



Apol 
Tsp509I 
Ddei I 



CviJI 
Haelll 



HaelV 
Hin4I 
Hinfl | 
Tf il j 

ScrFI | | Mnll 
Banll | | | Tthllllll 
Bspl286I j j j BciVI SfaNI | | 

EcoRII Sfcl | Fokl Hpyl88IX | j | 

I I I I I I I I I II 

TCCAGGATTCCGTATCCAATCTACAGGGTATTCCTTACAAGCATCCTCCGAAACTTCTTT 

2341 + + + + + + 2400 

AGGTCCTAAGGCATAGGTTAGATGTCCCATAAGGAATGTTCGTAGGAGGCTTTGAAGAAA 
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Fig. 22 (con't) 



Taal 

HphI | Faul Dpnl 

BSU36I CviRI | | Sthl32l| Sau3AI | 

Hpyl88IX Ddel MboII | j | Acil ||Hpyl88IX| j 

I I II I I III Ml 

ACATCAGAAAATCTCCTTAGGTTTTGCACAGTTCTTCACCCGCACTAAAGAAATCGGATC 

2401 + + + + + + 2460 

TGTAGTCTTTTAGAGGAATCCAAAACGTGTCAAGAAGTGGGCGTGATTTCTTTAGCCTAG 



CviJI 
BsmAI | 

BsmBI j BsaJI 

Tthlllllj CjePI BstDSI NlalV 

Maell || Taal Earl | Alul |DrdIl| 

Alwl | || MboII | BsmAI || CviJI j Mnll | 

II II I I III I I M 

AAGCAACAACGTCTCGGCTCACAATACAGTCTCTTCACTTTATGTTGAGCTTCCGTGGTT 

2461 + + + + + + 

TTCGTTGTTGCAGAGCCGAGTGTTATGTCAGAGAAGTGAAATACAACTCGAAGGCACCAA 



2520 



CviRI 
CjePI 
CviJI 
Hael 
Haelll 
StuI 
Fokl I 



Fokl 
Bed 
NlalV 
Avail | 
CviJI Sau96l| 
Taal RleAI | HphI | j I I BsmFI 

I I I I I II I II 

CCAAGAGGCCTTTGCAACATCCCACAGTTTAGCGTATGGCTATGGGGACCATCACCTCCA 

2521 + + + + + + 2580 

GGTTCTCCGGAAACGTTGTAGGGTGTCAAATCGCATACCGATACCCCTGGTAGTGGAGGT 



Maelll 

Tsp45I Afllll BsmFI Fnu4HI 

Mnll |Hpyl78III Maell CviJI | Tsel | 

III I II II 

CGCCTACATCCGTCACATCAAGAACAGGGCAGAAGGGACGTGTTATAGCCATACATTAGC 

2581 + + + + + + 2640 

GCGGATGTAGGCAGTGTAGTTCTTGTCCCGTCTTCCCTGCACAATATCGGTATGTAATCG 



Mnll 
BscGI 

Alul CviJI 
CviJI Bbvl BbvCI | 

Fnu4HI | CviJI | BsaJI HphI BpulOI j 

Tsel | j Bbvl | | Styl MboII | Ddel | 

II I II I I II II 

AGCAGCTATCGGCTGTTCTTTCCCTTGGCAACAGAAATCCTATCTTCACCTCAGCCCGTT 

2641 + + + + + + 

TCGTCGATAGCCGACAAGAAAGGGAACCGTTGTCTTTAGGATAGAAGTGGAGTCGGGCAA 



2700 
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Fig. 22 (con't) 



Muni 
Tsp509I 
BseMII | 



Cjel 
Mae 1 1 
HphI 



Sthl32I 



| CviRI | | 



I 



I 



I 



Cjel 
MboII 
Maelll 
Tsp4 5I 
Tthlllll 
NspV | 
Taql | 
Earl | | 
I I I 



CGTTCAGGCAATTGCAATACGTTCTCACCAAACAGCGTTCGAAGAGATTGGTGACAATCC 

2701 + + + + + + 27 60 

GCAAGTCCGTTAACGTTATGCAAGAGTGGTTTGTCGCAAGCTTCTCTAACCACTGTTAGG 

BsaJI 
Styl 
Mnll | 
Hinf I 
Tf il 

I BsmAI CviJI Hpyl88IX Hin4I Bfal | 

i ii riii 

CCGAAAGTTTGTCTCTCAAAAGCCTTTCTATAATCTGACCTTACCTCTAGGAATCCAAGG 

+ + + + + + 

GGCTTTCAAACAGAGAGTTTTCGGAAAGATATTAGACTGGAATGGAGATCCTTAGGTTCC 



Sthl32I 
HphI 



2761 



2820 



Sfcl Hpyl78III Mspl 

Rsal I Bfal I BsaWl| 

BsaAI | I Xbal | | BsrFI | 

Maell| I |PleI Hinf I ||| PinAI j 

I II I I I I Ml M 

AAAATGGCAGTCAAAATTCCACGTACCTACAGAATGGACTCTAGAACTTTCTTACCAACC 

2821 + + + + + + 2880 

TTTTACCGTCAGTTTTAAGGTGCATGGATGTCTTACCTGAGATCTTGAAAGAATGGTTGG 



ApOl 
Tsp509I 



ScrFI 
BsaJI 
ECORII 

Nlaiv 
Ppil 
Acil 

Maelll BsrBI 
Tsp45I Cac8I | 

Rsal BslI | Mnll | | 

I II Ml 

GGTACTCTATCAACAAAATCCCCAAATCGGTGTCACGCTACTTGCGAGCGGAGGTTCCTG 

2881 + + + + + + 2940 

CCATGAGATAGTTGTTTTAGGGGTTTAGCCACAGTGCGATGAACGCTCGCCTCCAAGGAC 
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Fig. 22 (con't) 



CviJI 
Hael 
Haelll 
Bf al 
Avrll | 
BsaJI j 
Styl | 

EcoRV | | I MslI BsrDI Rsal 

GGATATCCTAGGCCATAACTATGTTCGCAATGCTTTAGGGTACAAAGTCCACAATCAAAC 



MboII 
I 



2941 



CCTATAGGATCCGGTATTGATACAAGCGTTACGAAATCCCATGTTTCAGGTGTTAGTTTG 



-+ 3000 



Hhal 
I 



Dpnl 
Sau3AI 
Hpyl78III | 
Earl | j 

Sapl Taqlj 
I I 



BseRI 
BslI 
Dpnl 
Sau3AI 
BseRI 
BsaJI | 
Styl | 
Taqll | | 
I I 



Bsal 
BsmAI 
Alwl |TaqI 



TGCGCTCTTCCGTTCTCTCGATCTATTCTTGGATTACCAAGGATCGGTCTCCTCCTCGAC 



3001 



- + 3060 



ACGCGAGAAGGCAAGAGAGCTAGATAAGAACCTAATGGTTCCTAGCCAGAGGAGGAGCTG 



Apol 
Tsp509I 
Tthlllll | 

Mnll Msel| | 

Mnll I BCCI Rsal | | | Tsp509I 

I I I I II I t 

ATCTACGCACCATCTCCAAGCAGGAAGTACCTTAAAATTCTAAAATAAAAGAACGATAAA 



3061 



- + 3120 



TAGATGCGTGGTAGAGGTTCGTCCTTCATGGAATTTTAAGATTTTATTTTCTTGCTATTT 

CviJI 
Mwol 
Mae I I | 

Msel Alul | | 

Tsp509I | Hpyl88IX CviJI | j 
II I I 



Plel 
Mnll | Hinfl 
I I I 



ATTGAAATCTTTAGAATTAACAACTATCCGATGAGCTACGTTAGCCCAATCGGTAGAGGA 



3121 



-+ 3180 



TAACTTTAGAAATCTTAATTGTTGATAGGCTACTCGATGCAATCGGGTTAGCCATCTCCT 
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Fig. 22 (con't) 

Dral 
Mnll 
Swal 
Msel | 
Apol | | 
Tsp509I | | 

I li 
CTCCCTCAAAATTTAAATAA 

3181 + + 3200 

GAGGGAGTTTTAAATTTATT 
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tagacactat aaaacaaatt atagacaaaa aatctagcat tgatttattc agaatatttc 60 



tttctatttg tgaacgagta tgcgcttttt ttgcttcgga atg ttg ctt cct ttt 

Met Leu Leu Pro Phe 
1 5 



115 



act ttt gta ttg get aat gaa ggt etc caa ctt cct ttg gag acc tat 
Thr Phe Val Leu Ala Asn Glu Gly Leu Gin Leu Pro Leu Glu Thr Tyr 
10 15 20 



163 



att aca tta agt cct gaa tat caa gca gec cct caa gta ggg ttt act 211 
lie Thr Leu Ser Pro Glu Tyr Gin Ala Ala Pro Gin Val Gly Phe Thr 
25 30 35 

cat aac caa aat caa gat etc gca att gtc ggg aat cac aat gat ttc 259 
His Asn Gin Asn Gin Asp Leu Ala lie Val Gly Asn His Asn Asp Phe 
40 45 50 

ate ttg gac tat aag tac tat egg teg aat gga ggt get ctt acc tgt 307 
lie Leu Asp Tyr Lys Tyr Tyr Arg Ser Asn Gly Gly Ala Leu Thr Cys 
55 60 65 

aag aat ctt ctg ate tct gaa aat ata ggg aat gtc ttc ttt gag aag 355 
Lys Asn Leu Leu lie Ser Glu Asn lie Gly Asn Val Phe Phe Glu Lys 
70 75 80 85 

aat gtc tgt ccc aat tct ggc ggg gca att tat get get caa aat tgc 403 
Asn Val Cys Pro Asn Ser Gly Gly Ala lie Tyr Ala Ala Gin Asn Cys 
90 95 100 



acg ate tec aag aat cag aac tat gca ttt act aca aac ttg gtc tct 
Thr He Ser Lys Asn Gin Asn Tyr Ala Phe Thr Thr Asn Leu Val Ser 
105 110 115 



gca eta aat teg gac agt tta gga gga ggg att tat agt ggg aac tct 
Ala Leu Asn Ser Asp Ser Leu Gly Gly Gly He Tyr Ser Gly Asn Ser 
185 190 195 



451 



gac aat cct aca gee act gcg gga tea eta ttg ggt gga get etc ttt 499 

Asp Asn Pro Thr Ala Thr Ala Gly Ser Leu Leu Gly Gly Ala Leu Phe 
120 125 130 

gee ata aat tgc tct att act aat aac eta gga cag gga act ttc gtt 547 

Ala He Asn Cys Ser He Thr Asn Asn Leu Gly Gin Gly Thr Phe Val 
135 140 145 

gac aat etc get tta aat aag ggg ggt gee etc tat act gag acg aac 595 

Asp Asn Leu Ala Leu Asn Lys Gly Gly Ala Leu Tyr Thr Glu Thr Asn 

150 155 160 165 

tta tct att aaa gac aat aaa ggc ccg ate ata ate aag cag aat egg 643 

Leu Ser He Lys Asp Asn Lys Gly Pro He He He Lys Gin Asn Arg 
170 175 180 



691 
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Fig. 23 (con't) 

eta aat ata gag gga aat tct gga get ata cag ate aca age aac tct 739 

Leu Asn He Glu Gly Asn Ser Gly Ala He Gin He Thr Ser Asn Ser 

200 205 210 



tat gat ccc ata gaa cat gag etc cct tec tec ttc ccc ata etc ttt 
Tyr Asp Pro He Glu His Glu Leu Pro Ser Ser Phe Pro He Leu Phe 
375 380 385 



835 



883 



tea gga tct ggg gga ggc ata ttt tct ace caa aca etc acg ate tec 787 
Ser Gly Ser Gly Gly Gly He Phe Ser Thr Gin Thr Leu Thr He Ser 
215 220 225 

teg aat aaa aaa etc ata gaa ate agt gaa aat tec gcg ttc gca aat 
Ser Asn Lys Lys Leu He Glu He Ser Glu Asn Ser Ala Phe Ala Asn 
230 235 240 245 

aac tat gga teg aac ttc aat cca gga gga gga ggt ctt act ace ace 
Asn Tyr Gly Ser Asn Phe Asn Pro Gly Gly Gly Gly Leu Thr Thr Thr 
250 255 260 

ttt tgc acg ata ttg aac aac cga gaa ggg gta etc ttt aac aat aac 931 
Phe Cys Thr He Leu Asn Asn Arg Glu Gly Val Leu Phe Asn Asn Asn 
265 270 275 

caa age cag age aac ggt gga gec att cat gcg aaa tct ate att ate 979 
Gin Ser Gin Ser Asn Gly Gly Ala He His Ala Lys Ser He He He 
280 285 290 

aaa gaa aat ggt cct gta tac ttt tta aat aac act gca act egg gga 
Lys Glu Asn Gly Pro Val Tyr Phe Leu Asn Asn Thr Ala Thr Arg Gly 
295 300 305 

ggg get etc etc aac tta tea gca ggt tct gga aac gga age ttc ate 
Gly Ala Leu Leu Asn Leu Ser Ala Gly Ser Gly Asn Gly Ser Phe He 
310 315 320 325 

tta tct gca gat aat gga gat att ate ttt aac aat aat acg gee tec 
Leu Ser Ala Asp Asn Gly Asp He He Phe Asn Asn Asn Thr Ala Ser 
330 335 340 

aag cat gec etc aat cct cca tac aga aac gec att cac teg act cct 1171 
Lys His Ala Leu Asn Pro Pro Tyr Arg Asn Ala He His Ser Thr Pro 
345 350 355 

aat atg aat ctg caa ata gga gee cgt ccc ggc tat cga gtg ctg ttc 1219 
Asn Met Asn Leu Gin He Gly Ala Arg Pro Gly Tyr Arg Val Leu Phe 
360 365 370 



1027 



1075 



1123 



1267 



aat ttc gaa acc ggt cat aca ggt aca gtt tta ttt tea ggg gaa cat 1315 
Asn Phe Glu Thr Gly His Thr Gly Thr Val Leu Phe Ser Gly Glu His 
390 395 400 405 
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gta cac cag aac ttt acc gat gaa atg aat ttc ttt tec tat tta agg 1363 

Val His Gin Asn Phe Thr Asp Glu Met Asn Phe Phe Ser Tyr Leu Arg 

410 415 420 

aac act teg gaa eta cgt caa gga gtc ctt get gtt gaa gat ggt gcg 1411 

Asn Thr Ser Glu Leu Arg Gin Gly Val Leu Ala Val Glu Asp Gly Ala 

425 430 435 

ggg ctg gee tgc tat aag ttc ttc caa cga gga ggc act eta ctt eta 1459 

Gly Leu Ala Cys Tyr Lys Phe Phe Gin Arg Gly Gly Thr Leu Leu Leu 

440 445 450 

ggt caa ggt gcg gtg ate acg aca gca gga acg att ccc aca cca tec 1507 

Gly Gin Gly Ala Val lie Thr Thr Ala Gly Thr . lie Pro Thr Pro Ser 

455 460 465 

tea aca cca acg aca gta gga agt act ata act tta aat cac att gee 1555 

Ser Thr Pro Thr Thr Val Gly Ser Thr lie Thr Leu Asn His lie Ala 

470 475 480 485 

att gac ctt cct tct att ctt tct ttt caa get cag get cca aaa att 1603 

lie Asp Leu Pro Ser lie Leu Ser Phe Gin Ala Gin Ala Pro Lys lie 

490 495 500 

tgg att tac ccc aca aaa aca gga tct acc tat act gaa gat tec aac 1651 

Trp lie Tyr Pro Thr Lys Thr Gly Ser Thr Tyr Thr Glu Asp Ser Asn 

505 510 515 



ccg aca ate aca ate tea gga act etc acc tta cgc aac age aac aac 
Pro Thr lie Thr lie Ser Gly Thr Leu Thr Leu Arg Asn Ser Asn Asn 
520 525 530 



aca tct eta eta ggc gcg aat aca aaa cac aag ctg etc tat gca aac 
Thr Ser Leu Leu Gly Ala Asn Thr Lys His Lys Leu Leu Tyr Ala Asn 
600 605 610 



1699 



gaa gat ccc tac gat agt ctg gat etc teg cac tct ctt gag aaa gtt 1747 
Glu Asp Pro Tyr Asp Ser Leu Asp Leu Ser His Ser Leu Glu Lys Val 
535 540 545 

ccc ctt ctt tat att gtc gat gtc get gca caa aaa att aac tct teg 1795 
Pro Leu Leu Tyr lie Val Asp Val Ala Ala Gin Lys lie Asn Ser Ser 
550 555 560 565 

caa ctg gat eta tec aca tta aat tct ggc gaa cac tat ggg tat caa 1843 
Gin Leu Asp Leu Ser Thr Leu Asn Ser Gly Glu His Tyr Gly Tyr Gin 
570 575 580 

ggc ate tgg teg acc tat tgg gta gaa act aca aca ate acg aac cct 1891 
Gly He Trp Ser Thr Tyr Trp Val Glu Thr Thr Thr He Thr Asn Pro 
585 590 595 



1939 
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tgg tct cct eta ggc tac cgt cct cat ccc gaa cgt cga gga gaa ttc 
Trp Ser Pro Leu Gly Tyr Arg Pro His Pro Glu Arg Arg Gly Glu Phe 
615 620 625 



1987 



att acg aat gec ttg tgg caa teg gca tat acg get ctt gca gga etc 
lie Thr Asn Ala Leu Trp Gin Ser Ala Tyr Thr Ala Leu Ala Gly Leu 
630 635 640 645 



2035 



cac tec etc tec tec tgg gat gaa gag aag ggt cat gca get tec eta 
His Ser Leu Ser Ser Trp Asp Glu Glu Lys Gly His Ala Ala Ser Leu 
650 655 660 



2083 



caa ggc att ggt ctt ctg gtt cat caa aaa gac aaa aac ggt ttt aag 
Gin Gly lie Gly Leu Leu Val His Gin Lys Asp Lys Asn Gly Phe Lys 
665 670 675 



2131 



gga ttt cgt agt cat atg aca ggt tat agt get acc ace gaa gca acc 
Gly Phe Arg Ser His Met Thr Gly Tyr Ser Ala Thr Thr Glu Ala Thr 
680 685 690 



2179 



tct tct caa agt ccg aat ttc tct tta gga ttt get cag ttc ttc tec 
Ser Ser Gin Ser Pro Asn Phe^ Ser Leu Gly Phe Ala Gin Phe Phe Ser 
695 700 705 



2227 



aaa get aaa gaa cat gaa tct caa aat age acg tec tct cac cac tat 
Lys Ala Lys Glu His Glu Ser Gin Asn Ser Thr Ser Ser His His Tyr 
710 715 720 725 



2275 



ttc tct gga atg tgc ata gca aaa tac tct ctt caa aga gtg ata cgt 
Phe Ser Gly Met Cys lie Ala Lys Tyr Ser Leu Gin Arg Val lie Arg 
730 735 740 



2323 



eta tct gtg tct ctt get tat atg ttt acc teg gaa cat acc cat aca 
Leu Ser Val Ser Leu Ala Tyr Met Phe Thr Ser Glu His Thr His Thr 
745 750 755 



2371 



atg tat cag ggt etc ctg gaa ggg aac tct cag gga tct ttc cac aac 
Met Tyr Gin Gly Leu Leu Glu Gly Asn Ser Gin Gly Ser Phe His Asn 
760 765 770 



2419 



cat acc tta gca ggg get etc tec tgt gtt ttc tta cct caa cct cac 
His Thr Leu Ala Gly Ala Leu Ser Cys Val Phe Leu Pro Gin Pro His 
775 780 785 



2467 



ggc gag tec ctg cag ate tat ccc ttt att act gec tta gec ate cga 
Gly Glu Ser Leu Glh lie Tyr Pro Phe lie Thr Ala Leu Ala lie Arg 
790 795 800 805 



2515 



gga aat ctt get gcg ttt caa gaa tct gga gac cat get egg gaa ttt 
Gly Asn Leu Ala Ala Phe Gin Glu Ser Gly Asp His Ala Arg Glu Phe 
810 815 820 



2563 



122/165 



SOBSTITUTE SHEET (RULE 26) 



WO 00/24765 PCT/C A 99/00992 



Fig. 23 (con't) 

tec eta cac cgc ccc eta acg gac gtc tec etc cct gta gga ate cgc 

Ser Leu His Arg Pro Leu Thr Asp Val Ser Leu Pro Val Gly He Arg 

825 830 835 

get tct tgg aag aac cac cac cga gtt ccc eta gtc tgg etc aca gaa 

Ala Ser Trp Lys Asn His His Arg Val Pro Leu Val Trp Leu Thr Glu 

840 845 850 

att tec tat cgc tct act etc tat agg caa gat cct gaa etc cac teg 

He Ser Tyr Arg Ser Thr Leu Tyr Arg Gin Asp Pro Glu Leu His Ser 

855 860 865 



acg ctg agt cac tac tta aac gtg gcg agt aga atg aga ttt 
Thr Leu Ser His Tyr Leu Asn Val Ala Ser Arg Met Arg Phe 
920 925 930 



2611 



2659 



2707 



aaa tta ctg att age caa ggt acg tgg acg acg cag gee act cct gtg 2755 
Lys Leu Leu He Ser Gin Gly Thr Trp Thr Thr Gin Ala Thr Pro Val 
870 875 880 885 

ace tac aat get tta ggg ate aaa gtg aaa aat acc atg cag gtg ttt 2803 
Thr Tyr Asn Ala Leu Gly He Lys Val Lys Asn Thr Met Gin Val Phe 
890 895 900 

cct aaa gtc act etc tec tta gat tac tct gcg gat att tct tec tec 2851 
Pro Lys Val Thr Leu Ser Leu Asp Tyr Ser Ala Asp He Ser Ser Ser 
905 910 915 



2893 



taacaataag tgaccaaaac agaaagatta aggaacctct agtgtcaaag actcctccta 2953 
agtttttatt etatcteggg aatttcacag cctgcatgtt egggatg 3000 
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Figure 24 (RY-46) 

Restriction enzyme analysis of CPN1 00628 



Tsp509I 
I 



HaelV 

Hin4I Bfal 



Hpyl88IX Sspl 



TAGACACTATAAAACAAATTATAGACAAAAAATCTAGCATTGATTTATTCAGAATATTTC 



-+ 60 



ATCTGTGATATTTTGTTTAATATCTGTTTTTTAGATCGTAACTAAATAAGTCTTATAAAG 

Hpyl88IX 
Hhal Mwol | 
I I ! 

TTTCTATTTGTGAACGAGTATGCGCTTTTTTTGCTTCGGAATGTTGCTTCCTTTTACTTT 

61 + + + + + + 120 

AAAGATAAACACTTGCTCATACGCGAAAAAAACGAAGCCTTACAACGAAGGAAAATGAAA 

Bce83I 
Hpyl78III 
Msel | 

Bsal Bsal Cjelj j 

CviJI Cjel BsmAI BsmAI Mmel | j j 

I i I i i ii i 

TGTATTGGCTAATGAAGGTCTCCAACTTCCTTTGGAGACCTATATTACATTAAGTCCTGA 

121 + + + + + + 180 

ACATAACCGATTACTTCCAGAGGTTGAAGGAAACCTCTGGATATAATGTAATTCAGGACT 



Mill I 
Tthlllll 
Bbvl | 

CviJI Bsllj 
Fnu4HI | EcoNI | j 
Tsel | |SmlI | j j 
II I I II 



Muni 
Tsp509I 
Sthl32I 
Dpnl 
Bglll 



BstYI 
Sau3AI 
Hpyl78III | 
II 

ATATCAAGCAGCCCCTCAAGTAGGGTTTACTCATAACCAAAATCAAGATCTCGCAATTGT 

181 + + + + + + 240 

TATAGTTCGTCGGGGAGTTCATCCCAAATGAGTATTGGTTTTAGTTCTAGAGCGTTAACA 



Rsal 

Hinfl Seal BsiEI 

Tfil TatI | Mnll BsiHKAI 

Hpyl78III I Taqll | | TaqI Bspl286I 

I I III I I 

CGGGAATCACAATGATTTCATCTTGGACTATAAGTACTATCGGTCGAATGGAGGTGCTCT 

24i + + + + + + 300 

G CC CTTAGTGTTACTAAAGTAGAACCTG ATATT CATGATAG C C AGCTTAC CTC CACG AGA 
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Hpyl88IX 
Dpnl 
Sau3AI 
Hpyl88IX| 
Hinf I | j 

MboII Tf il j | 



Bbsl 
MboII 



XmnI BsmFI 
I I 

TACCTGTAAGAATCTTCTGATCTCTGAAAATATAGGGAATGTCTTCTTTGAGAAGAATGT 

301 + + + + + + 360 

ATGGACATTCTTAGAAGACTAGAGACTTTTATATCCCTTACAGAAGAAACTCTTCTTACA 



BslI 
Faul 
Sthl32I | 
Tsp509I j 
MboII | j 
I I 



Fnu4HI 
Tsel 
BstAPI 
Tsp509I | 
Bbvl | | 

Acil| j Mwol 

II i I 



Dpnl 
Sau3AI | 
CviRI | | 
Tsp509I | j | 



Hpyl8 8IX 
Hinfl | 
Tf il 



CTGTCCCAATTCTGGCGGGGCAATTTATGCTGCTCAAAATTGCACGATCTCCAAGAATCA 

361 + + + + + + 420 

GACAGGGTTAAGACCGCCCCGTTAAATACGACGAGTTTTAACGTGCTAGAGGTTCTTAGT 



Dpnl 
Sau3AI 
Taqll 
TspRI 
Acil 
BtsI 
AlwNI | 
CjePI 
Cvi JI | 
Bsal Faul | 

Nsil BsmAI Sthl32l|| 

CviRI | Hpyl88IX| Sfcl Ml 

II II I III 

GAACTATGCATTTACTACAAACTTGGTCTCTGACAATCCTACAGCCACTGCGGGATCACT 

421 + + + + + + 480 

CTTGATACGTAAATGATGTTTGAACCAGAGACTGTTAGGATGTCGGTGACGCCCTAGTGA 



Alwl 



Banll 
BsiHKAI 
Bspl286I 
SacI 
Alul | CjePI 
CviJI | Mwol |Tsp509I 
I I II I 



BslI 
EcoNI 
Bfal 
Avrll | 
BsaJI | 
Styl| 
I I 



ATTGGGTGGAGCTCTCTTTGCCATAAATTGCTCTATTACTAATAACCTAGGACAGGGAAC 

481 + + + + + + 540 

TAACCCACCTCGAGAGAAACGGTATTTAACGAGATAATGATTATTGGATCCTGTCCCTTG 
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541 



BsmAI 
BsmBI 
Bspl286I 
BseMII 

Bmgl 
BseSI 

Dral NlalVj | | | Mnll 

Hindi Msel| BanI |DdeI 

I II I M 

TTTCGTTGACAATCTCGCTTTAAATAAGGGGGGTGCCCTCTATACTGAGACGAACTTATC 

+ f. \- H + h 

AAAGCAACTGTTAGAGCGAAATTTATTCCCCCCACGGGAGATATGACTCTGCTTGAATAG 



600 



Hinf I 
Bcgl 
Sthl32I 
BsaBI 



Msel 
I 



Sthl32I 
Dpnl 
Sau3AI 
CviJI | 
Haelll | 
Sau96l| | 
II I 



Hpyl8 8IX 
Apol 



Tsp509I 
Tthiilll 
Bspl286I | 
Bmgl | | 
Tfil BseSI j | 
I I II 



Mnll 
Taal 



TATTAAAGACAATAAAGG CCCG AT CAT AATCAAG CAGAATCGGG CACTAAATT CGGACAG 

601 + -- + + + + + 660 

ATAATTTCTGTTATTTCCGGGCTAGTATTAGTTCGTCTTAGCCCGTGATTTAAGCCTGTC 



Alul 
CviJI 

Bcgl Hpyl78III | 

CjePI | CjePI Apol | | 

Mnll I BseRI Mnll | Tsp509I j | 

mi i ii ill 

TTTAGGAGGAGGGATTTATAGTGGGAACTCTCTAAATATAGAGGGAAATTCTGGAGCTAT 

661 + + + + + + 720 

AAATCCTCCTCCCTAAATATCACCCTTGAGAGATTTATATCTCCCTTTAAGACCTCGATA 



Bpml 
MboII 
Dpnl | 
Eco57I | | 
Sau3AI j j 
I I 



AlwNI 
Mnll 
Dpnl 
Tthiilll 

BstYI 
Sau3AI 
Earl | 
Hpyl78III | 
I I 



Sau3AI 
Hpyl78III | 
BseRI I 



Alwl 

I III 

ACAGATCACAAGCAACTCTTCAGGATCTGGGGGAGGCATATTTTCTACCCAAACACTCAC 

721 + + + + + + 780 

TGTCTAGTGTTCGTTGAGAAGTCCTAGACCCCCTCCGTATAAAAGATGGGTTTGTGAGTG 
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Apol 
Tsp509I Thai 
TspRI Acil | 



781 



TaqI 
Tthlllll | 
Dpnl | I Mnll 

III I III 

GATCTCCTCGAATAAAAAACTCATAGAAATCAGTGAAAATTCCGCGTTCGCAAATAACTA 

. + + + + + 840 



CTAGAGGAGCTTATTTTTTGAGTATCTTTAGTCACTTTTAAGGCGCAAGCGTTTATTGAT 



BslI 
Mnll 
ScrFI 
Mnll 



TaqI EcoRII | 

Dpnl | Mnll | | 

Sau3AI | | Alwl | I j 

ii i i r 



BseRI 
BseRI | 
I I 



CviRI 



TGGATCGAACTTCAATCCAGGAGGAGGAGGTCTTACTACCACCTTTTGCACGATATTGAA 



841 



- + 900 



ACCTAGCTTGAAGTTAGGTCCTCCTCCTCCAGAATGATGGTGGAAAACGTGCTATAACTT 

CviJI 



Taal CviJI 
CjePI | Xcml |NlaIV| Mwol 

III II I I II I 

C AACCGAGAAGGGGTACTCTTTAACAATAACC AAAG C C AGAG CAACGGTGGAG C CATTCA 



BslI Rsal Msel 



901 



-+ 960 



GTTGGCTCTTCCCCATGAGAAATTGTTATTGGTTTCGGTCTCGTTGCCACCTCGGTAAGT 

Aval 
Mnll 
TspRI 
CviRI | 
Dral BtsI | | 

Msel| Sthl32l| j j 
II II I I 

TGCGAAATCTATCATTATCAAAGAAAATGGTCCTGTATACTTTTTAAATAACACTGCAAC 



CjePI 
Nlalll I 



Avail BstZ17I 
Sau96I Accll 



961 



-+ 1020 



ACGCTTTAGATAGTAATAGTTTCTTTTACCAGGACATATGAAAAATTTATTGTGACGTTG 



Banll 
Bspl286I 

CviJI | Hpyl78III Alul 

Hin4l| | BplI | CviJI 

BseRI | | | BspMI Mnll I AlwNI | Hindlll | Sf cl 

I III I I I I I M I 
TCGGGGAGGGGCTCTCCTCAACTTATCAGCAGGTTCTGGAAACGGAAGCTTCATCTTATC 



1021 



1080 



AGCCCCTCCCCGAGAGGAGTTGAATAGTCGTCCAAGACCTTTGCCTTCGAAGTAGAATAG 
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PstI 
CviRI | 
I 



Hin4I 



Msel 



CviJI 
Haelll 



I 



Beef I 
Nlalll 
Nspl 
SphI 
Mnll | 
Cac8l| | 



Tthlllll 



TGCAGATAATGGAGATATTATCTTTAACAATAATACGGCCTCCAAGCATGCCCTCAATCC 

1081 + + + + + + 1140 

ACGTCTATTACCTCTATAATAGAAATTGTTATTATGCCGGAGGTTCGTACGGGAGTTAGG 



Banll 

CviRI BscGI 
Hinfl BsmFI | Bspl286I Mspl 

Taql | Hinfl | j CviJI | Neil 

Mnll Mnll Plel | | Tfil || NlaIV| |ScrFI 

II III IN II I I 

TCCATACAGAAACGCCATTCACTCGACTCCTAATATGAATCTGCAAATAGGAGCCCGTCC 

1141 + + + + + + 

AGGTATGTCTTTGCGGTAAGTGAGCTGAGGATTATACTTAGACGTTTATCCTCGGGCAGG 



1200 



Taql 
Sthl32I | 
CviJI | j 
Sthl32I | j | 
I 



Dpnl 
Sau3AI | 
Alwl | | 

I 



Banll 
BsiHKAI 
Bspl286I 
Sac I 
Alul 
CviJI 
BsaXI 
Nlalll 
Alol | 
Ppil| 
I 



Mnll 
I 



1201 



CGGCTATCGAGTGCTGTTCTATGATCCCATAGAACATGAGCTCCCTTCCTCCTTCCCCAT 

+ + + + + + 1260 

GCCGATAGCTCACGACAAGATACTAGGGTATCTTGTACTCGAGGGAAGGAGGAAGGGGTA 



Rsal 
Nlalll [ 
Nspl ! 
BsrGI | 
Tat I j 

Tsp509I NspV | | Taal AflHI | | 

Msel | Taql jj Rsal | BspLUlll || 

II I II I 1 III 

ACTCTTTAATTTCGAAACCGGTCATACAGGTACAGTTTTATTTTCAGGGGAACATGTACA 



Mspl 
BsaWI I 
BsrFI | 
PinAI j 



1261 



-+ 1320 



TGAGAAATTAAAGCTTTGGCCAGTATGTCCATGTCAAAATAAAAGTCCCCTTGTACATGT 
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Hpyl88IX Maell 



BslI 

Apol Msel | 

Tsp509I EcoNI | j 

I III II 

CCAGAACTTTACCGATGAAATGAATTTCTTTTCCTATTTAAGGAACACTTCGGAACTACG 

1321 + + + + + + 13 8° 

GGTCTTGAAATGGCTACTTTACTTAAAGAAAAGGATAAATTCCTTGTGAAGCCTTGATGC 



MboII 

Cac8I 

CviJI 

Hael 

Haelll 

Cac8I 

Bed Mwol 

Faul | CviJI | 

Sthl32I| j MboII Ml | | I Mnl1 

Hinfl Plel | | |AciI | | M I I I Mnl1 I 

I I II I I I II 

TCAAGGAGTCCTTGCTGTTGAAGATGGTGCGGGGCtGGCCTGCTATAAGTTCTTCCAACG 

1381 + + + + + + 1440 

AGTTCCTCAGGAACGACAACTTCTACCACGCCCCGACCGGACGATATTCAAGAAGGTTGC 



Hpyl78III 
Dpnl | 

Mmel Bell | | Hinfl 

Bfal | Sau3AI j j Tfil 

BseRI | | Acil | j | HphI Fokl | 

III I I I I I II 
AGGAGGCACTCTACTTCTAGGTCAAGGTGCGGTGATCACGACAGCAGGAACGATTCCCAC 



1441 



-+ 1500 



TCCTCCGTGAGATGAAGATCCAGTTCCACGCCACTAGTGCTGTCGTCCTTGCTAAGGGTG 



BsrDI 
CjePI 

Rsal Dral | 

Seal Msel | j 

Taal TatI | Cjel | j | 

I II I II I 

ACCATCCTCAACACCAACGACAGTAGGAAGTACTATAACTTTAAATCACATTGCCATTGA 

1501 + + + + + + 1560 

TGGTAGGAGTTGTGGTTGCTGTCATCCTTCATGATATTGAAATTTAGTGTAACGGTAACT 



Mnl I 

Cjel RleAI | 
Bed | Bsbl | j 
II Ml 



1561 



BstXI 
BseMII | 
BpulOI Apol 
Ddel Tsp509I 
Alul| NlalV | 
CjePI CviJI | CviJI | j 

I II II I 

CCTTCCTTCTATTCTTTCTTTTCAAGCTCAGGCTCCAAAAATTTGGATTTACCCCACAAA 

+ + + + + + 

GGAAGGAAGATAAGAAAGAAAAGTTCGAGTCCGAGGTTTTTAAACCTAAATGGGGTGTTT 



1620 
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Alwl 
RleAl 
Dpnl | 
BstYI | I 
Sau3AI I | 
I I 



Mmel 

Eco57I HphI | 

Hinfl Sthl32l| Hpyl78III | 

Tfil MboII | | Ddel | j 

I III IM 



BseMIl 



1621 



AACAGGATCTACCTATACTGAAGATTCCAACCCGACAATCACAATCTCAGGAACTCTCAC 

+ + + + + + 

TTGTCCTAGATGGATATGACTTCTAAGGTTGGGCTGTTAGTGTTAGAGTCCTTGAGAGTG 



1680 



Dpnl 



BstYI 
Sau3AI 
Hpyl7 8III 
MboII 
Pf 111081 



Dpnl 
BstYI | 
Sau3AI j 
Alwl I 



Hpyl78III 
Alwl Smll | 

I II 



CTTACGCAACAGCAACAACGAAGATCCCTACGATAGTCTGGATCTCTCGCACTCTCTTGA 



1681 



-+ 1740 



GAATGCGTTGTCGTTGTTGCTTCTAGGGATGCTATCAGACCTAGAGAGCGTGAGAGAACT 



Bbvl 
Bsgl | 
Bce83I | | 
Beg I | j | Taql 
II 



Bcgl 

CviRI Msel | 

Fnu4HI | MboII j | 
Tsel| |Tsp509I | j 

IM III 



Earl 



GAAAGTTCCCCTTCTTTATATTGTCGATGTCGCTGCACAAAAAATTAACTCTTCGCAACT 



1741 



CTTTCAAGGGGAAGAAATATAACAGCTACAGCGACGTGTTTTTTAATTGAGAAGCGTTGA 



-+ 1800 



BsrI 
Dpnl 
BstYI | 
Sau3AI 



Hindi 
Sf aNI 

Apol Accl | 

| Tsp509I Taql | 

Alwl Msel | Sail | | 

nil mi 

GGATCTATCCACATTAAATTCTGGCGAACACTATGGGTATCAAGGCATCTGGTCGACCTA 



1801 



CCTAGATAGGTGTAATTTAAGACCGCTTGTGATACCCATAGTTCCGTAGACCAGCTGGAT 

Bbvl 
Hhal | 

Hpyl78III Bfal Thai j 

I III 

TTGGGTAGAAACTACAACAATCACGAACCCTACATCTCTACTAGGCGCGAATACAAAACA 



-+ 1860 



1861 



■+ 1920 



AACCCATCTTTGATGTTGTTAGTGCTTGGGATGTAGAGATGATCCGCGCTTATGTTTTGT 
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Fnu4HI 
Alul | 

CviJI j 
Tsel 



Taal 
Mnll 
CviJI 
Fokl 
Bf al 



BseRI 
CviRI | 
I 



Bsal | 

BsmAI | 

Bsrl | | 
I I 



TaqI 
Sthl32I 
Maell 
Mnll | 
Mnll | | 
Hpyl78III | I I 
I 111 



CAAGCTGCTCTATGCAAACTGGTCTCCTCTAGGCTACCGTCCTCATCCCGAACGTCGAGG 

1921 + + + + + + 1980 

GTTCGACGAGATACGTTTGACCAGAGGAGATCCGATGGCAGGAGTAGGGCTTGCAGCTCC 

BseRI 
Beef I 
Hinf I 
CjePI | 
CviJI CviRI | 
Mwol |PleI | j 



Apol 
EcoRI 
Tsp509I 



BseRI BsmI 
I I 

AGAATTCATTACGAATGCCTTGTGGCAATCGGCATATACGGCTCTTGCAGGACTCCACTC 



1981 



-+ 2040 



TCTTAAGTAATGCTTACGGAACACCGTTAGCCGTATATGCCGAGAACGTCCTGAGGTGAG 



Alul 
CviJI 
Fnu4HI 
CviRI 
Nlalll 
Tsel 
MboII | 
Fokl | | 
SimI j j 



Earl 
Mnll | 
ScrFI | j 
BsaJl| j j CjePI 
BsaXI EcoRII | j j j Mnll | 

llll 



Bbsl 
MboII 
Bbvl | 
I I 



2041 



CCTCTCCTCCTGGGATGAAGAGAAGGGTCATGCAGCTTCCCTACAAGGCATTGGTCTTCT 

+ + + + + + 2100 

GGAGAGGAGGACCCTACTTCTCTTCCCAGTACGTCGAAGGGATGTTCCGTAACCAGAAGA 

Msel 



Taal 



Pflll08I Ndel 



2101 



Drdll 

I II II 

GGTTCATCAAAAAGACAAAAACGGTTTTAAGGGATTTCGTAGTCATATGACAGGTTATAG 

. . , + + + 2160 



2161 



CCAAGTAGTTTTTCTGTTTTTGCCAAAATTCCCTAAAGCATCAGTATACTGTCCAATATC 

Apol 
Tsp509I 
Hpyl88IX | 

Mnll | | Ddel 
MboII Earl | | j MboII | 

I llll M 

TGCTACCACCGAAGCAACCTCTTCTCAAAGTCCGAATTTCTCTTTAGGATTTGCTCAGTT 

+ + + + + + 2220 

ACGATGGTGGCTTCGTTGGAGAAGAGTTTCAGGCTTAAAGAGAAATCCTAAACGAGTCAA 
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Alul Hinfl CjePI 

CviJI Nlalll HphI | Xcml 

BseMII I Tfil Maell | Mnll | 

ii i ii ii 

CTTCTCCAAAGCTAAAGAACATGAATCTCAAAATAGCACGTCCTCTCACCACTATTTCTC 



2221 



■+ 2280 



GAAGAGGTTTCGATTTCTTGTACTTAGAGTTTTATCGTGCAGGAGAGTGGTGATAAAGAG 



Hpyl78III 



MboII 
CjePI ] 

CviRI I Earl Maell BsmAI 

ill ii i 

TGGAATGTGCATAGCAAAATACTCTCTTCAAAGAGTGATACGTCTATCTGTGTCTCTTGC 



2281 



-+ 2340 



ACCTTACACGTATCGTTTTATGAGAGAAGTTTCTCACTATGCAGATAGACACAGAGAACG 



Cjel 
Bsal 
BsmAI 

Cjel ScrFI 
Hpyl88IX| EcoRII | 

BsaJI I Mnll SimI | j I Ddel 

i ii i in 

TTATATGTTTACCTCGGAACATACCCATACAATGTATCAGGGTCTCCTGGAAGGGAACTC 



2341 



-+ 2400 



AATATACAAATGGAGCCTTGTATGGGTATGTTACATAGTCCCAGAGGACCTTCCCTTGAG 



2401 



Banll 
Bspl286I 
BpulOI CviJI | 

Ddel BslI | j BsmFI 

I I I I ! 

TCAGGGATCTTTCCACAACCATACCTTAGCAGGGGCTCTCTCCTGTGTTTTCTTACCTCA 

+ + + + + + 2460 

AGTCCCTAGAAAGGTGTTGGTATGGAATCGTCCCCGAGAGAGGACACAAAAGAATGGAGT 



Dpnl 
BstYI I BseMII 
Sau3AI | Alwl | 



Mnll 



Dpnl 
Beef I 
Bglll 
BstYI 
PstI 
Sau3AI 
Mnll CviRI | 
Hinfl | Plel j 
Sfcl | j 
I I 



I 



Fokl 



Bbvl 
BsaJI 
Hpyl88IX 
Bed 
CviJI 
Mnll 
BpulOI | 
Ddel I 



2461 



ACCTCACGGCGAGTCCCTGCAGATCTATCCCTTTATTACTGCCTTAGCCATCCGAGGAAA 

+ + + + + + 2520 

TGGAGTGCCGCTCAGGGACGTCTAGATAGGGAAATAATGACGGAATCGGTAGGCTCCTTT 
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Sthl32I 
Hpyl78III 

Bsal 
BsmAI 
Hinf I | 
Tf il 



2521 



Apol 
Tsp509I 
Hpyl78III 
BslI | 

Fmi4HI Tfil| | | Aval| j 

|NlaIII | | | |BpmI Acil 
111 I 

TCTTGCTGCGTTTCAAGAATCTGGAGACCATGCTCGGGAATTTTCCCTACACCGCCCCCT 

+ + + + + + 2580 

AGAACGACGCAAAGTTCTTAGACCTCTGGTACGAGCCCTTAAAAGGGATGTGGCGGGGGA 



Tsel |Hpyl78III | 
II 



BslI 



2581 



BsmAI 
BsmBI 
Aatll 

BsaHI | | Hinf I Hhal 

Maell I | Tfil Thai | 

Sfcl Mnll|AciI | j Drdll MboII 

... I II I I I I I 

AACGGACGTCTCCCTCCCTGTAGGAATCCGCGCTTCTTGGAAGAACCACCACCGAGTTCC 

+ + + + + + 2640 

TTGCCTGCAGAGGGAGGGACATCCTTAGGCGCGAAGAACCTTCTTGGTGGTGGCTCAAGG 

Hpyl78III 
Dpnl 
BstYI 
Sau3AI 
Alwl | 
Sfcl | j 



CviJI 
BslI | Apol 
Bfal | j Tsp509I 



2641 



CCTAGTCTGGCTCACAGAAATTTCCTATCGCTCTACTCTCTATAGGCAAGATCCTGAACT 

+ + + + + + 2700 

GGATCAGACCGAGTGTCTTTAAAGGATAGCGAGATGAGAGATATCCGTTCTAGGACTTGA 

Maelll 
Tsp45I 
MslI 
Cjel 
Hgal 



Tsp509I 
TagI | Cjel 
I 



BsaJI BsaAI 
Styl Maell[ 



CviJI I 
I 



Rsal | | 
I I 



CviJI 
Hael 
Haelll 
Cac8I | 



2701 



CCACTCGAAATTACTGATTAGCCAAGGTACGTGGACGACGCAGGCCACTCCTGTGACCTA 

+ + + + + + 2760 

GGTGAGCTTTAATGACTAATCGGTTCCATGCACCTGCTGCGTCCGGTGAGGACACTGGAT 
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Dpnl 
Sau3AI | 
I I 



BspMI 
Alwl | 



Aarl 
MslI | 
CviRI | | 
Nlalllj | 
I 



Maelll 
Tsp45I 
I 



CAATGCTTTAGGGATCAAAGTGAAAAATACCATGCAGGTGTTTCCTAAAGTCACTCTCTC 



2761 



GTTACGAAATCCCTAGTTTCACTTTTTATGGTACGTCCACAAAGGATTTCAGTGAGAGAG 



-+ 2820 



Ddel 



BseMII 
Bsp24I | 
MboII CjePI | 
Acil | Cjel | I 
I 



Maelll 
Tsp45I 
Hinf I | 
Mnll I 
Ddel | I 
I 



Cjel 
CjePI 
Bsp24I 
Maell | 
Msel | | 
Plel | | | 
II II 



CTTAGATTACTCTGCGGATATTTCTTCCTCCACGCTGAGTCACTACTTAAACGTGGCGAG 



2821 



-+ 2880 



GAATCTAATGAGACGCCTATAAAGAAGGAGGTGCGACTCAGTGATGAATTTGCACCGCTC 

Mnll 
Plel | 

Maelll BseRl| j 

Msel Tsp45I Msel NlalV Bfal | | | 

I I I I I IM 

TAGAATGAGATTTTAACAATAAGTGACCAAAACAGAAAGATTAAGGAACCTCTAGTGTCA 



2881 



-+ 2940 



ATCTTACTCTAAAATTGTTATTCACTGGTTTTGTCTTTCTAATTCCTTGGAGATCACAGT 



Apol 
Tsp509I 
Sthl32I Hpyl78III | 
Ddel Mnll | Aval | | 



Hpyl78III 
Nlalll 
CviRI | 
Cac8I | | 
Sthl32I | | 
CviJI | |NspI 



Hinf I 

AAGACTCCTCCTAAGTTTTTATTCTATCTCGGGAATTTCACAGCCTGCATGTTCGGGATG 



2941 



- + 3000 



TTCTGAGGAGGATTCAAAAATAAGATAGAGCCCTTAAAGTGTCGGACGTACAAGCCCTAC 



134/165 
SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 



PCT/CA99/00992 



Figure 25: 

cactgtggat gtgatattcg cagaacctcc cgtcaaatat actctagata taggaagcaa 60 



attacgactt taaaccttat ttaacgacag ggttgaggc atg cct ctt tct ttc 

Met Pro Leu Ser Phe 
1 5 



114 



aaa 
Lys 


tct 
Ser 


tea 
Ser 


tct 
Ser 


ttt 
Phe 
10 


tQt 

w 3 

Cys 


eta 
Leu 


ctt 
Leu 


gee 
Ala 


tgt 
Cys 
15 


tta 
Leu 


tgt 
Cys 


agt 
Ser 


gca 
Ala 


agt 
Ser 
20 


tqc 

w 3 w 

Cys 


162 


aces 
y *~y 

Ala 


ttt 
Phe 


get 
Ala 


aaa 

y °.y 

Glu 
25 


act 
Thr 


aqa 
Arg 


etc 
Leu 


qqa 

Gly 


qqq 
333 

Gly 
30 


aac 
Asn 


ttt 
Phe 


Qtt 

Val 


cct 
Pro 


cca 
Pro 
35 


att 
He 


acq 
Thr 


210 


aat 
Asn 


caa 
Gin 


yy 
Gly 

40 


gaa 
Glu 


y a j 
Glu 


ate 
He 


tta 
Leu 


etc 
Leu 
45 


act 
Thr 


tea 
Ser 


aat 
Asp 


ttt 

Phe 


gtt 
Val 
50 


tat 

Cys 


tea 
Ser 


aac 
Asn 


25 8 


1-1- r 

Phe 


1- 1-er 

Leu 
55 


yyy 
Gly 


arc 

Ala 


agt 

Ser 


ttt 
Phe 


tea 
Ser 
60 


aat 
Ser 


tec 
Ser 


ttt 
Phe 


ate 
He 


aat 
Asn 
65 


agt 
Ser 


tec 
Ser 


age 
Ser 


aat 
Asn 


3 06 


etc 
Leu 
70 


tec 
Ser 


t " a 

Leu 


tta 
Leu 


acc 

y yy 
Gly 


aag 
Lys 
75 


acc 
Gly 


ctt 
Leu 


tec 
Ser 


tta 
Leu 


a *-3 

Thr 
80 


ttt 
Phe 


acc 
Thr 


tct 
Ser 


ten 

Cys 


caa 
Gin 
85 


2 5 4 


get 
Ala 


ee <~ 
Pro 


aca 
Thr 


aat 
Asn 


agt 
Ser 
90 


aac 

Asn 


tat 
Tyr 


aca 

y <~y 
Ala 


eta 
Leu 


ctt 
Leu 
95 


tct 

Ser 


gee 
Ala 


gca 
Ala 


y ^y 
Glu 


act 
Thr 
100 


c tg 

Leu 


4 02 


Thr 


ttc 
Phe 


aag 

Lys 


aat 
Asn 
105 
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Fig. 25 (con't) 

agt tct ccc acg gca gtc gtt aaa ttc ate aat aac acc get acc atg 738 
Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn Asn Thr Ala Thr Met 
Ser Ser Pro Thr Ala Val Val Lys Phe lie Asn Asn Thr Ala Thr Met 

200 205 210 

age ttc tec cat aac ttt act teg tea gga ggc ggc gtg att tat gga 786 
Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly Gly Val lie Tyr Gly 
Ser Phe Ser His Asn Phe Thr Ser Ser Gly Gly Gly Val lie Tvr Glv 
215 220 225 

gga age tct etc ctt ttt gaa aac aat tct gga tgc ate ate ttc acc 834 
Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly Cys lie lie Phe Thr 
Gly Ser Ser Leu Leu Phe Glu Asn Asn Ser Gly Cys lie lie Phe Thr 
230 235 _ * 240 2 45 

gee aac tec tgt gtg aac age tta aaa ggc gtc acc cct tea tea gga 882 

Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val Thr Pro Ser Ser Gly 

Ala Asn Ser Cys Val Asn Ser Leu Lys Gly Val Thr Pro Ser Se*- Glv 
250 255 260 

acc tat get tta gga agt ggc gga gec ate tgc ate cct acg gga ac*- 9*0 
Thr Tyr Ala Leu Gly Ser Gly Gly Ala He Cvs He Pro Thr Gly Th- 
Thr Tyr Ala Leu Gly Ser Gly Gly Ala He Cvs He Pro Thr Gly Th- 
265 270 ' 275 

ttc gaa tta aaa aac aat cag ggg aag tgc acc ttc tct tat aat ggt 978 

Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr Phe Ser Tvr Asn Glv 

Phe Glu Leu Lys Asn Asn Gin Gly Lys Cys Thr Phe Ser Ty^ \sn Glv 

280 285 290 

aca cca aat gat gcg ggt gcg ate tac gee gaa ace tgc aac ate gta 1026 

Thr Pro Asn Asp Ala Gly Ala He Tyr Ala Glu Thr Cys Asn lie Val 

Thr Pro Asn Asp Ala Gly Ala He Tyr Ala Glu Thr Cys Asn He Val 
295 300 305 

ggg aac cag ggt gee ttg etc eta gat age aac act gca gcg aga aat 1074 

Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn Thr Ala Ala Arg Asn 

Gly Asn Gin Gly Ala Leu Leu Leu Asp Ser Asn Thr Ala Ala Arg Asn 
310 315 320 325 

ggc gga gee ate tgt get aaa gtg etc aat att caa gga cgc ggt cct 1122 
Gly Gly Ala lie Cys Ala Lys Val Leu Asn lie Gin Gly Arg Gly Pro 
Gly Gly Ala He Cys Ala Lys Val Leu Asn He Gin Gly Arg Gly P-o 
330 335 340 

att gaa ttc tct aga aac cgc gcg gag aag ggt gga get att ttc ata 1170 

He Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly Gly Ala lie Phe He 

He Glu Phe Ser Arg Asn Arg Ala Glu Lys Gly Gly Ala He Phe He 
345 350 355 

ggc ccc tct gtt gga gac cct gcg aag caa aca teg aca ctt acg att 12-3 

Gly Pro Ser Val Gly Asp Pro Ala Lys Gin Thr Ser Thr Leu Thr He 

Gly Pro Ser Val Gly Asp Pro Ala Lys Gin Thr Ser Thr Leu Thr lie 

360 •* 365 370 

ttg get tec gaa ggt gat att gcg ttc caa gga aac atg etc aat aca 1266 

Leu Ala Ser Glu Gly Asp He Ala Phe Gin Gly Asn Met Leu Asn Thr 

Leu Ala Ser Glu Gly Asp He Ala Phe Gin Gly Asn Met Leu Asn Thr 

375 380 385 
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Fig. 25 (con't) 

aaa cct gga ate cgc aat gec ate act gta gaa gca ggg gga gag att 1314 

Lys Pro Gly lie Arg Asn Ala lie Thr Val Glu Ala Gly Gly Glu lie 

Lys Pro Gly lie Arg Asn Ala lie Thr Val Glu Ala Gly Gly Glu lie 
390 395 400 405 

gtg tct eta tct gca caa gga ggc tea cgt ctt gta ttt tat gat ccc 1362 
Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu Val Phe Tyr Asp Pro 
Val Ser Leu Ser Ala Gin Gly Gly Ser Arg Leu Val Phe Tyr Asp Pro 

410 415 420 

att aca cat age etc cca ace aca agt ccg tct aat aaa gac act aca 1410 

lie Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asp lie Thr 

lie Thr His Ser Leu Pro Thr Thr Ser Pro Ser Asn Lys Asd lie Thr 

425 430 435 

ate aac get aat ggc get tea gga tct gta gtc ttt aca agt aag gga 1458 

lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Gly 

lie Asn Ala Asn Gly Ala Ser Gly Ser Val Val Phe Thr Ser Lys Gly 

440 445 450 

etc tec tct aca gaa etc erg ttg cct gee aac acg aca act ata ctt 1506 

Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn Thr Thr Thr lie Leu 

Leu Ser Ser Thr Glu Leu Leu Leu Pro Ala Asn Thr Thr Thr lie Leu 

455 460 465 

eta gga aca gtc aag ate get agt gga gaa ctg aag att act gac aat 1554 

Leu Gly Thr Val Lys lie Ala Ser Gly Glu Leu Lys lie Thr Asp Asn 

Leu Gly Thr Val Lys lie Ala Ser Gly Glu Leu Lys lie Thr Asd Asn 

470 475 480 * 485 

gcg gtt gtc aat gtt get ggc ttc get act cag ggc tea ggt cag ctt 1602 

Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 

Ala Val Val Asn Val Ala Gly Phe Ala Thr Gin Gly Ser Gly Gin Leu 

490 495 500 

ace ctg ggc tct gga gga ace tta ggg ctg gca aca ccc acg gga gca 1650 

Thr Leu Gly Ser Gly Giy Thr Leu Gly Leu Ala Thr Pro Thr Gly Ala 

Thr Leu Gly Ser Gly Gly Thr Leu Gly Leu Ala Thr Pro Thr Gly Ala 
505 510 515 

cct gee get gta gac ttt acg att gga aag tta gca ttc gat cct ttt 1698 

Pro Ala Ala Val Asp Phe Thr lie Gly Lys Leu Ala Phe Asp Pro Phe 

Pro Ala Ala Val Asp Phe Thr lie Gly Lys Leu Ala Phe Asp Pro Phe 

520 525 530 

tee ttc eta aaa aga gat ttt gtt tea gca tea gta aat gca ggc aca 174 6 

Ser Phe Leu Lys Arg Asp Phe Val Ser Ala Ser Val Asn Ala Gly Thr 

Ser Phe Leu Lys Arg Aso Phe Val Ser Ala Ser Val Asn Ala Giy Thr 
535 540 545 

aaa aac gtc act tta aca gga get ctg gtt ctt gat gaa cat gac gtt 17 94 

Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu Asp Glu His Asp Val 

Lys Asn Val Thr Leu Thr Gly Ala Leu Val Leu Asp Glu His Aso Val 

550 555 ' 560 * 565 

aca gat ctt tat gat atg gtg tea tta caa tct cca gta gca att cct 18 4 2 

Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala lie Pro 

Thr Asp Leu Tyr Asp Met Val Ser Leu Gin Ser Pro Val Ala lie Pro 
570 575 530 
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Fig. 25 (con't) 
ate get g 
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Gly Glu lie Ala Thr Pro Ser His Tyr Gly Tyr Gin Gly Lys Trp Ser 

Gly Glu lie Ala Thr Pro Ser His Tyr Gly Tyr Gin Gly Lys Trp Ser 
600 605 610 

tac aca tgg tec cgt ccc ctg tta att cca get cct gat gga gga ttt 198 6 
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2034 



2082 



2130 



gca ttc tct gat att etc caa gat gtt ctt ttg ata gat cat ccc ggg 2178 

Ala Phe Ser Asp lie Leu Gin Asp Val Leu Leu He Asp His Pro Gly 

Ala Phe Ser Asd He Leu Gin Asp Val Leu Leu He Asp His Pro Gly 

680 * 685 690 



2226 



2274 



710 

gcg eta tct atg aac tac acg gac cac act acg tta gga ctt tct ttc 2322 
Ala Leu Ser Met Asn Tyr Thr Asp His Thr Thr Leu Gly Leu Ser Phe 
Ala Leu Ser Met Asn Tvr Thr Aso His Thr Thr Leu Gly Leu Ser Phe 
730 " 735 740 

2370 



2418 



138/165 



SUBSTITUTE SHEET (RULE 26) 



WO 00/24765 PCT/CA99/00992 



Fig. 25 (con't) 

act caa aag age gag gec tta att tec tgg aaa gca get tat ggt tat 24 66 

Thr Gin Lys Ser Glu Ala Leu lie Ser Trp Lys Ala Ala Tyr Gly Tyr 

Thr Gin Lys Ser Glu Ala Leu lie Ser Trp Lys Ala Ala Tyr Gly Tyr 
775 780 785 

tec aaa aat cac eta aat ace ace tac etc aga cct gac aaa get cca 2514 

Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg Pro Asp Lys Ala Pro 

Ser Lys Asn His Leu Asn Thr Thr Tyr Leu Arg Pro Asp Lys Ala Pro 

790 795 800 ~ 805 

aaa tct caa ggg caa tgg cat aac aat agt tac tat gtt ctt att tct 25 62 

Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr Tyr Val Leu lie Ser 

Lys Ser Gin Gly Gin Trp His Asn Asn Ser Tyr Tyr Val Leu He Ser 

810 815 820 

gca gaa cat cct ttc eta aac tgg tgt ctt ctt aca aga cct ctg get 2610 

Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 

Ala Glu His Pro Phe Leu Asn Trp Cys Leu Leu Thr Arg Pro Leu Ala 
825 830 835 

caa get tgg gat ctt tea ggt ttt att tec gca gaa ttc eta ggt ggt 2653 

Gin Ala Trp Asp Leu Ser Gly Phe He Ser Ala Glu Phe Leu Gly Gly 

Gin Ala Trp Asp Leu Ser Gly Phe lie Ser Ala Glu Phe Leu Glv Gly 

840 845 850 

tgg caa agt aag ttc aca gaa act gga gat ctg caa cgt age ttt agt 2706 

Trp Gin Ser Lys Phe Thr Glu Thr Gly Asp Leu Gin Arg Ser Phe Ser 

Trp Gin Ser Lys Phe Thr Glu Thr Gly Asp Leu Gin Arg Ser Phe Ser 

855 ~ 860 " 865 

aga ggt aaa ggg tac aat gtt tec eta ccg ata gga tgt tct tct caa 2754 
Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Gin 
Arg Gly Lys Gly Tyr Asn Val Ser Leu Pro He Gly Cys Ser Ser Gin 
870 875 880 885 

tgg ttc aca cca ttt aag aag get cct tct aca ctg ace ate aaa ctt 2802 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr He Lys Leu 

Trp Phe Thr Pro Phe Lys Lys Ala Pro Ser Thr Leu Thr He Lys Leu 

890 895 900 

gee tac aag cct gat ate tat cgt gtc aac cct cac aat att gtg act 2850 

Ala Tyr Lys Pro Asp lie Tyr Arg Val Asn Pro His Asn lie Val Thr 

Ala Tyr Lys Pro Aso lie Tyr Arg Val Asn Pro His Asn He Val Thr 

905 910 915 

gtc gtc tea aac caa gag age act teg ate tea gga gca aat eta cgc 2398 

Val Val Ser Asn Gin Glu Ser Thr Ser He Ser Gly Ala Asn Leu Arg 

Val Val Ser Asn Gin Glu Ser Thr Ser He Ser Gly Ala Asn Leu Arg 

920 925 930 

cgc cac gg-c ttg ttt gta caa arc cat gat gta gta gat czc ace gag 294 6 

Arg His Gly Leu Phe Val Gin He His Asp Val Val Asp Leu Thr Glu 

Arg His Gly Leu Phe Val Gin He His Aso Val Val Aso Leu Thr Glu 

935 * * 940 " 945 
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Fig. 25 (con't) 

gac act cag gcc ttt eta aac tat acc ttt gac ggg aaa aat gga ttt 2994 

Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp Gly Lys Asn Gly Phe 

Asp Thr Gin Ala Phe Leu Asn Tyr Thr Phe Asp Gly Lys Asn Gly Phe 
950 955 960 965 

aca aac cac cga gtg tct aca gga eta aaa tec aca ttt taaaactcta 3043 

Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 

Thr Asn His Arg Val Ser Thr Gly Leu Lys Ser Thr Phe 
970 975 

agetctgett agagttttct gtagccccgg tegtcttaga atcctctatc catcatcgaa 3103 

gaacttagca atgaaggesa agattctcac tctatgagaa ccccccc 3150 
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Figure 26 (RY-47) 

Restriction enzyme analysis of CPN1 00630 



TspRI 
Taal | 
I 



Hpyl78III 
Mnll I 
Sthl32I Bfalj 
BscGI | Xbal | | 

I ! Ill 



Fokl BscGI I Xbal I i Tsp509I 

I 

CACTGTGGATGTGATATTCGCAGAACCTCCCGTCAAATATACTCTAGATATAGGAAGCAA 



GTGACACCTACACTATAAGCGTCTTGGAGGGCAGTTTATATGAGATCTATATCCTTCGTT 



-+ 60 



Nlalll 
Nspl 
SphI 
Cac8I I 



Mnll 
MboII I 



Dral 

Msel | Msel Mnll 

II II II II 

ATTACGATTTTAAACCTTATTTAACGACAGGGTTGAGGCATGCCTCTTTCTTTCAAATCT 



61 



-+ 120 



TAATGCTAAAATTTGGAATAAATTGCTGTCCCAACTCCGTACGGAGAAAGAAAGTTTAGA 



BsmAI 
Hhal 
Thai 
BseMII | 

AccI CviRI Mwol | | 

I I II I 

TCATCTTTTTGTCTACTTGCCTGTTTATGTAGTGCAAGTTGCGCGTTTGCTGAGACTAGA 



Hinf I 
Mnll 
Bf al | 
Plel | | 
Ddel | | | 



121 



- + 180 



AGTAGAAAAACAGATGAACGGACAAATACATCACGTTCAACGCGCAAACGACTCTGATCT 

Hpyl88IX 
Tthlllll 
MboII 
HphI 
Dpnl 
Bglll 
BstYI 

Hinfl Sau3AI 

Tfil Eco57I | 

Hpyl88IX Tsp509I Mnll | Earl | | 

I Mill 
CTCGGAGGGAACTTTGTTCCTCCAATTACGAATCAGGGTGAAGAGATCTTACTCACTTCA 

+ + + + + + 

GAGCCTCCCTTGAAACAAGGAGGTTAATGCTTAGTCCCACTTCTCTAGAATGAGTGAAGT 

GATTTTGTTTGTTCAAACTTCTTGGGGGCGAGTTTTTCAAGTTCCTTTATCAATAGTTCC 



181 



240 



241 



-+ 300 



CTAAAACAAACAAGTTTGAAGAACCCCCGCTCAAAAAGTTCAAGGAAATAGTTATCAAGG 
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Fig. 26 (con't) 



CviJI 
Haelll 
ECOO109I | 
BslI Sau96I j 
I 



Acll 
Maell 
Msel | 
I 



Alul 
CviJI 
Mnll | 
I I 



AGCAATCTCTCCTTATTAGGGAAGGGCCTTTCCTTAACGTTTACCTCTTGTCAAGCTCCT 

301 + + + + + + 360 

TCGTTAGAGAGGAATAATCCCTTCCCGGAAAGGAATTGCAAATGGAGAACAGTTCGAGGA 



Plel 

Acil| Apol 

Fnu4HI | Tsp509I 

Taulj Hpyl88IX Mboll] 

Maelll Hhal BsmAI | | Hinfl |Hpyl78IIl|| 

I I III I I Ml 

ACAAATAGTAACTATGCGCTACTTTCTGCCGCAGAGACTCTGACCTTCAAGAATTTTTCT 

361 + + + + + + 420 

TGTTTATCATTGATACGCGATGAAAGACGGCGTCTCTGAGACTGGAAGTTCTTAAAAAGA 



Drdll CviJI 

NlaIV| Haelll Cjel 

Bsp24l|| Fnu4HI | Mnll 

Cjel j | Taul | CjePI | 

CjePllj TaqI Acil| | Bsp24l|| 

III I II I Ml 

TCTATAAACTTTACAGGGAACCAATCGACAGGACTTGGCGGCCTCATCTACGGAAAAGAT 

421 + + + + + + 480 

AGATATTTGAAATGTCCCTTGGTTAGCTGTCCTGAACCGCCGGAGTAGATGCCTTTTCTA 



Cjel 
Bsbl 

Dpnl Taal | 

Sau3AI | Bpml | j 

Mboll | | Pflll08I | j j 

I II I I I I 

ATTGTTTTCCAATCTATCAAAGATTTGATCTTCACTACGAACCGTGTTGCCTATTCTCCA 

481 + + + + + + 540 

TAACAAAAGGTTAGATAGTTTCTAAACTAGAAGTGATGCTTGGCACAACGGATAAGAGGT 



Maelll 
Sf cl 
Faul 
Sthl32I ! 
Acil | : 

Mwol | III | CviJI 

Cjel || |j | Taal Sfcl NlaIV| 

I I I I I II II I I I M 

GCATCTGTAACTACGTCGGCAACTCCCGCAATCACTACAGTAACTACAGGAGCCTCTGCT 

541 + + + + + + 600 

CGTAGACATTGATGCAGCCGTTGAGGGCGTTAGTGATGTCATTGATGTCCTCGGAGACGA 



Mae 1 1 
SfaNI | 
Maelll | j 
AlwNI | | | 
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Fig. 26 (con't) 



Mmel Dpnl 

Sfcl TspRI Sau3AI | 

Plel| Taql| Clal | j 

Mnll I iHinfl Taal | TaqI | | 

I II I I II HI 

CTCCAACCTACAGACTCACTCACTGTCGAAAACATATCCCAATCGATCAAGTTTTTTGGG 

601 + + + + + + 660 

GAGGTTGGATGTCTGAGTGAGTGACAGCTTTTGTATAGGGTTAGCTAGTTCAAAAAACCC 

Beef I 

Tsp509I Apol | 

CviRl| BsaJI Tsp509I | 

NlalV CviJI | BstDSI Msel | j 

| I I I I >>l 

AACCTTGCCAACTTCGGCTCTGCAATTAGCAGTTCTCCCACGGCAGTCGTTAAATTCATC 

661 + + + + + + 720 

TTGGAACGGTTGAAGCCGAGACGTTAATCGTCAAGAGGGTGCCGTCAGCAATTTAAGTAG 

BsaXI 

Alul Fnu4HI 

CviJI Hpyl7 8III Taul 

Acil Nlalll | Mnll | Acil| Mnll 

I I I I II I 

AATAACACCGCTACCATGAGCTTCTCCCATAACTTTACTTCGTCAGGAGGCGGCGTGATT 

721 + + + + + + 780 

TTATTGTGGCGATGGTACTCGAAGAGGGTATTGAAATGAAGCAGTCCTCCGCCGCACTAA 

Nsil 

HaeIV| 

Hin4I j 

HphI | 

CviRI | 

MboII 

Hpyl78III | 

Alul Tsp509I | j 

CviJI SfaNI | | | 

| I I I I 

TATGGAGGAAGCTCTCTCCTTTTTGAAAACAATTCTGGATGCATCATCTTCACCGCCAAC 

7B1 + + + + + 840 

ATACCTCCTTCGAGAGAGGAAAAACTTTTGTTAAGACCTACGTAGTAGAAGTGGCGGTTG 



Acil 
Fokl | 
SfaNI j 
Cjel | | 
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Maelll 
Tsp4 5I 
BsaHI 
Cjel 
Mwol 



HphI 
Msel 
Alul | 
CviJI | 
Hgal | | 



NlalV 
Hpyl78III | 
BslI | I 
EcoNI | j I 
I I I I 

TCCTGTGTGAACAGCTTAAAAGGCGTCACCCCTTCATCAGGAACCTATGCTTTAGGAAGT 

841 + + + + + + 

AGGACACACTTGTCGAATTTTCCGCAGTGGGGAAGTAGTCCTTGGATACGAAATCCTTCA 



900 



CviRI 
Sthl32I 
Bed 
CviJI 
NlalV 



Ecil 
Acil | 
Fokl | | 
III 



BscGI 
Sf aNI | 
II 



Msel 
Tsp509I | 
NspV | | 
Taql | | 



BseSI 
CviRI 
ApaLI | 



901 



GGCGGAGCCATCTGCATCCCTACGGGAACTTTCGAATTAAAAAACAATCAGGGGAAGTGC 

! + + + -\ h 

CCGCCTCGGTAGACGTAGGGATGCCCTTGAAAGCTTAATTTTTTGTTAGTCCCCTTCACG 



960 



Faul 
Sthl32I | 

BsiHKAI Rsal | j 

Bspl286I SfaNI j | 

| I II I I I I 

ACCTTCTCTTATAATGGTACACCAAATGATGCGGGTGCGATCTACGCCGAAACCTGCAAC 



Mwol 
Dpnl | 
Acil Sau3AI I 



CviRI 



961 



■+ 1020 



TGGAAGAGAATATTACCATGTGGTTTACTACGCCCACGCTAGATGCGGCTTTGGACGTTG 



NlalV 
BanI 
ScrFI 
BsaJI 



Drdll | 
Pf 111081 EcoRIlj 
BspMI | NlalV | j 
I M 



Bfal 



PstI 
TspRI 
Fnu4HI 
CviRI 
Tsel 
BtsI | 
Sfd j 
Bsbl | j 
I 



NlalV 
Ecil | 
Acil | j 
Bbvl | | | 
Ml I 



1021 



ATCGTAGGGAACCAGGGTGCCTTGCTCCTAGATAGCAACACTGCAGCGAGAAATGGCGGA 

+ + + + + + 1080 

TAGCATCCCTTGGTCCCACGGAACGAGGATCTATCGTTGTGACGTCGCTCTTTACCGCCT 
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Avail Hpyl78III 

Sau96I Apol 

Sspl Acil | EcoRI 

Bed BsiHKAI | Thai j Tsp509I Bfal 



CviJI |MwoI Bspl2 86I | Tthllll |HgaI | Xbal 

I II i I I I I 



Acil 
S 

GCCATCTGTGCTAAAGTGCTCAATATTCAAGGACGCGGTCCTATTGAATTCTCTAGAAAC 



1081 + + + + + + H40 

CGGTAGACACGATTTCACGAGTTATAAGTTCCTGCGCCAGGATAACTTAAGAGATCTTTG 

NlalV 

Acil CviJI | 

Hhal Mmel Haelll j BslI 

Thai Alul | Eco0109l|| Bsal | Mnll 

Thai I CviJI j Sau96I | |BsmAI j SimI 

I I II III I I I 
CGCGCGGAGAAGGGTGGAGCTATTTTCATAGGCCCCTCTGTTGGAGACCCTGCGAAGCAA 

1141 + + + + + + 1200 

GCGCGCCTCTTCCCACCTCGATAAAAGTATCCGGGGAGACAACCTCTGGGACGCTTCGTT 

BsaJI Nlalll 
Hpyl88IX Styl Nspl 

Taql Tthlllll CviJI | HphI | Cjel | 

I ill M ii 

ACATCGACACTTACGATTTTGGCTTCCGAAGGTGATATTGCGTTCCAAGGAAACATGCTC 

1201 + + + + + + 1260 

TGTAGCTGTGAATGCTAAAACCGAAGGCTTCCACTATAACGCAAGGTTCCTTTGTACGAG 

Hinfl Hin4I 

Tfil Cjel TspRI Hin4I | 

ScrFI | Bed | Taal | BsaXI | j 

EcoRII | | Acil BsrDl|Sfcl| j Bsgl | | j 

I I I I II II I MM 

AATACAAAACCTGGAATCCGCAATGCCATCACTGTAGAAGCAGGGGGAGAGATTGTGTCT 

1261 + + + + + + 1320 

TTATGTTTTGGACCTTAGGCGTTACGGTAGTGACATCTTCGTCCCCCTCTCTAACACAGA 

Dpnl 

CviRI Maell Sau3AI | CviJI 

BsmAI Mnll CviJI | Alwl | j BsaXI | 

II II III M 
CTATCTGCACAAGGAGGCTCACGTCTTGTATTTTATGATCCCATTACACATAGCCTCCCA 

1321 + + + + + + 1380 

GATAGACGTGTTCCTCCGAGTGCAGAACATAAAATACTAGGGTAATGTGTATCGGAGGGT 
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Alwl 
HaelV 
Hin4I 
Sfcl 
AlwNI 
Dpnl 
BstYI 



Mnll 



Eco57I 



Sau3AI 
Hpyl78III 
Haell | 
Hhal | | 
Mwol | | j 
III I 



ACCACAAGTCCGTCTAATAAAGACATTACAATCAACGCTAATGGCGCTTCAGGATCTGTA 



1381 



TGGTGTTCAGGCAGATTATTTCTGTAATGTTAGTTGCGATTACCGCGAAGTCCTAGACAT 



■+ 1440 



Cac8I Bsbl 



BseRI BsmFI 
Plel|HinfI Sfcl |MnlI 

II I III I I 

GTCTTTACAAGTAAGGGACTCTCCTCTACAGAACTC CTGTTG C CTG CCAAC ACGACAACT 



1441 



1500 



CAGAAATGTTCATTCCCTGAGAGGAGATGTCTTGAGGACAACGGACGGTTGTGCTGTTGA 



Dpnl 
Sau3AI 
Hpyl7 8III 
Taal | 
CjePI | | 
Bfal || || |BfaI 
II I I 



Eco57I 
Acil | 
CjePI MboII | | 
I III 



ATACTTCTAGGAACAGTCAAGATCGCTAGTGGAGAACTGAAGATTACTGACAATGCGGTT 



1501 



-+ 1560 



TATGAAGATCCTTGTCAGTTCTAGCGATCACCTCTTGACTTCTAATGACTGTTACGCCAA 



Hpyl78III 
Banll 
Bspl286I 
CviJI 
Mnll 
ScrFI 
BsaJI 
BsaJI 

Banll BseMII 
Bspl286I EcoRII 
Bpul0l| Alul | 

Ddel | CviJI j 
Ddel CviJI | | BseMII | jl! II I I NlalV 

i i i i in mi 

GTCAATGTTGCTGGCTTCGCTACTCAGGGCTCAGGTCAGCTTACCCTGGGCTCTGGAGGA 

1561 + + + + + + 1620 

CAGTTACAACGACCGAAGCGATGAGTCCCGAGTCCAGTCGAATGGGACCCGAGACCTCCT 



Mwol 
CviJI | 
Cac8I | 
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AccI 
BspMI 
Sf cl 
MspAlI 
Acil 
Fnu4HI 
Taul 
Mwol 



Taqll 
Aarl 
BsiHKAI 
Bspl286I 
BscGI 
BsaJI 



BstDSI 
Bsbl 
Bpml 



Sthl32I 
Bsu36I Cac8I | 
Ddel CviJI I 



1621 



ACCTTAGGGCTGGCAACACCCACGGGAGCACCTGCCGCTGTAGACTTTACGATTGGAAAG 

+ + + + + + 1680 

TGGAATCCCGACCGTTGTGGGTGCCCTCGTGGACGGCGACATCTGAAATGCTAACCTTTC 



Dpnl 
Sau3AI | 

Taql| I Cac8I 

Alwl | | | <^riRI | 

Bsmi | | | sf aNI I I 

I III III 
TTAGCATTCGATCCTTTTTCCTTCCTAAAAAGAGATTTTGTTTCAGCATCAGTAAATGCA 

1681 + + + + + 

AATCGTAAGCTAGGAAAAAGGAAGGATTTTTCTCTAAAACAAAGTCGTAGTCATTTACGT 



+ 1740 



Maelll 
Tsp45I 
Maell | 



Msel 



Hpyl78III 

Drdll 
Alol 
Ppil 
Banll 



BsaXI 
BsiHKAI 
Bspl286I 
Sad 
Alul | 
CviJI | 
I I 



Dpnl 

Maelll Bglll | 
Maell| BstYI j 
Nlalll | | Sau3AI | 



1741 



GGCACAAAAAACGTCACTTTAACAGGAGCTCTGGTTCTTGATGAACATGACGTTACAGAT 

+ + + + + + 1800 

CCGTGTTTTTTGCAGTGAAATTGTCCTCGAGACCAAGAACTACTTGTACTGCAATGTCTA 
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Bpml 
HaelV 
Hin4I 



Tsp509I 



Bsrl 



1801 



CTTTATGATATGGTGTCATTACAATCTCCAGTAGCAATTCCTATCGCTGTTTTCAAAGGA 

+ + + + + + 1860 

GAAATACTATACCACAGTAATGTTAGAGGTCATCGTTAAGGATAGCGACAAAAGTTTCCT 

CviJI 
Bgll 

Hinfl Mwol 
Bed Hin4I | Cjel | 

Hpyl78III I Plel | I CviJI | j 

, , . I I I I I III 

GCAACCGTTACTAAGACAGGATTTCCTGATGGGGAGATTGCGACTCCAAGCCACTACGGC 



Maelll 
Taal 
BsaXI | Ddel 



1861 



CGTTGGCAATGATTCTGTCCTAAAGGACTACCCCTCTAACGCTGAGGTTCGGTGATGCCG 



•+ 1920 



Tsp509I 
Sthl32I 
BscGI 



BsaJI 
Styl 
I 



BsmFI 
Avail 
Sau96I 
Bcefl | 
BsmFI | 
I I 



Cjel 
NlalV 
Avail 
Nlalll 
Sau96I 
BslI | 
I I 



Msel 



Acelll 
BsmFI 
BslI 
Bed 
ECONI 
Hpyl78III 
AlwNI | 
Mnll j 
Alul | | 
CviJI I 



1921 



TACCAAGGAAAGTGGTCCTACACATGGTCCCGTCCCCTGTTAATTCCAGCTCCTGATGGA 

H + + + + + 

ATGGTTCCTTTCACCAGGATGTGTACCAGGGCAGGGGACAATTAAGGTCGAGGACTACCT 



1980 



NlalV 
Avail 
ECOO109I 
PspSII 

Sau96I j Bpml Hpyl88IX 

ScrFI | j Hhal | Apol | 

EcoRII | | | Mnll | | ECORI | 

Mnll || || Bfal | | | Tsp509I | 

Mill III! II 

GGATTTCCTGGAGGTCCCTCTCCTAGCGCAAATACTCTCTATGCTGTATGGAATTCAGAC 

1981 + ---- + + + + + 2040 

CCTAAAGGACCTCCAGGGAGAGGATCGCGTTTATGAGAGATACGACATACCTTAAGTCTG 
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BssSI 
MslI 



Sthl32I 
Maelll 
Hpyl78III 
Aval 
Dpnl 



BstYI 
Sau3AI 
Ddel | 
Alwl | | 
I 



Cjel 
BsaXI | 
Hin4I | 
Tsp509I | | 

I i I 



ACTCTCGTGCGTTCTACCTATATCTTAGATCCCGAGCGTTACGGAGAAATTGTCAGCAAC 



2041 



TGAGAGCACGCAAGATGGATATAGAATCTAGGGCTCGCAATGCCTCTTTAACAGTCGTTG 



-+ 2100 



Alul Hpyl88IX 
CviJI Ddel Cjel BsmI | Fokl 

I | I I I I 

AGCTTATGGATTTCCTTCTTAGGAAATCAGGCATTCTCTGATATTCTCCAAGATGTTCTT 



2101 



2160 



TCGAATACCTAAAGGAAGAATCCTTTAGTCCGTAAGAGACTATAAGAGGTTCTACAAGAA 



Dpnl 
HaeIV| Neil 
Hin4l| ScrFI 
Sau3AI | j Aval | 
Sthl32I | | BsaJI j 
II 



Sthl32I 
Neil 
ScrFI 
Smal 
Mspl 



Alul 
CviJI 
Hindi I I | 
Thai | j 
Acil | | | 
I I 



BsaXI 
Hin41 
CviJI | 
NlaIV| | 
Mwol | | j TaqI 
I II I I 



TTGATAGATCATCCCGGGTTGTCCATAACCGCGAAAGCTTTAGGAGCCTATGTCGAACAC 



2161 



-+ 2220 



AACTATCTAGTAGGGCCCAACAGGTATTGGCGCTTTCGAAATCCTCGGATACAGCTTGTG 



Mnll CviJI 
BslI | Nlalll | 
I I 



CviJI 
Mwol | 
Mnll Bbvl | | 
I 



Fnu4HI 

Alul | 
CviJI j 

Tsel |HhaI 

II I 



ACACCAAGACAAGGACATGAGGGCTTTTCAGGTCGCTATGGAGGCTACCAAGCTGCGCTA 



2221 



-+ 2280 



TGTGGTTCTGTTCCTGTACTCCCGAAAAGTCCAGCGATACCTCCGATGGTTCGACGCGAT 

Alul 
CviJI 

Avail Fnu4HI | 

Sau96I Tsel | j 

Cjel || | Bbvl 

I I I M I I 

TCTATGAACTACACGGACCACACTACGTTAGGACTTTCTTTCGGGCAGCTTTATGGAAAA 



Cjel 



Maell Sthl32I 



2281 



-+ 2340 



AGATACTTGATGTGCCTGGTGTGATGCAATCCTGAAAGAAAGCCCGTCGAAATACCTTTT 
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Hinf I 
Tfil Hpyl88IX 
Pf 111081 |MaeII Ddel | BseMII 

III I I I 

ACTAACGCCAACCCCTACGATTCACGTTGCTCAGAACAAATGTATTTACTCTCGTTCTTT 



2341 



-+ 2400 



TGATTGCGGTTGGGGATGCTAAGTGCAACGAGTCTTGTTTACATAAATGAGAGCAAGAAA 



Tsp509I 
I 



Hinf I 
Hpyl78III 
Maelll | 
Tsp45I | 
Plel | | 
I I 



ScrFI 
EcoRII 
Tsp509I 
Msel 



Mnll 



CviJI 
Hael 
Haelll 
Stul 



Alul 
CviJI 
Fnu4HI | 
Tsel| | 
II I 



GGTCAATTCCCTATCGTGACTCAAAAGAGCGAGGCCTTAATTTCCTGGAAAGCAGCTTAT 

2401 + + + + + + 2460 

CCAGTTAAGGGATAGCACTGAGTTTTCTCGCTCCGGAATTAAAGGACCTTTCGTCGAATA 



Alul 
CviJI 
BseMII | 

HphI Hpyl88IX Bce83I | | 

Bbvl I Ddel | Mnll | jj Smll 

|| I I II II I 

GGTTATTCCAAAAATCACCTAAATACCACCTACCTCAGACCTGACAAAGCTCCAAAATCT 



2461 



CCAATAAGGTTTTTAGTGGATTTATGGTGGATGGAGTCTGGACTGTTTCGAGGTTTTAGA 



-+ 2520 



PstI 
CviRI | 

BsrDI Maelll Fokl Sfcl | j Alol 

I I I I I I I 

CAAGGGCAATGGCATAACAATAGTTACTATGTTCTTATTTCTGCAGAACATCCTTTCCTA 



2521 



-+ 2580 



GTTCCCGTTACCGTATTGTTATCAATGATACAAGAATAAAGACGTCTTGTAGGAAAGGAT 



Bbsl 

MboII BsrI Bce83I 



Dpnl 
BstYI 
Sau3AI 
Alul 
CviJI 
Hindlll 
Mnll 
Smll | 
CviJI | | 



Alwl 
I 



Acil 
I 



AACTGGTGTCTTCTTACAAGACCTCTGGCTCAAGCTTGGGATCTTTCAGGTTTTATTTCC 



2581 



-+ 2640 



TTGACCACAGAAGAATGTTCTGGAGACCGAGTTCGAACCCTAGAAAGTCCAAAATAAAGG 
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Bfal 
Avrll 
BsaJI 
Apol | 
EcoRI j 
Tsp509I Styl 



Maell 
CviRI 
Dpnl 
Bglll 



Alul 
CviJI 

Cjel AlwNI | | | |Cjel| 

II 

GCAGAATTCCTAGGTGGTTGGCAAAGTAAGTTCACAGAAACTGGAGATCTGCAACGTAGC 



BsrI 
BstYI 
Sau3AI 
AlwNI I 



2641 



-+ 2700 



CGTCTTAAGGATCCACCAACCGTTTCATTCAAGTGTCTTTGACCTCTAGACGTTGCATCG 



Drdll 
Fokl I 



Bpml 

Mnll I Rsal MboII 

li I I II 

TTTAGTAGAGGTAAAGGGTACAATGTTTCCCTACCGATAGGATGTTCTTCTCAATGGTTC 



2701 



-+ 2760 



AAATCATCTCCATTTCCCATGTTACAAAGGGATGGCTATCCTACAAGAAGAGTTACCAAG 

NlalV Bed BsaBI 

Msel CviJI I TspRlj CviJI EcoRV | 

III II III 

ACACCATTTAAGAAGGCTCCTTCTACACTGACCATCAAACTTGCCTACAAGCCTGATATC 



2761 



-+ 2820 



TGTGGTAAATTCTTCCGAGGAAGATGTGACTGGTAGTTTGAACGGATGTTCGGACTATAG 



Maelll 
Tsp45I 
Mnll | 
Sspl | | 



Ddel 
Dpnl 
Sau3AI 
Taql| 
BsiHKAI | j 
Bspl286I j | 



PshAI BsmAI 
Hindi Sspl || Taal| BsmBI 

.11 I II 

TATCGTGTCAACCCTCACAATATTGTGACTGTCGTCTCAAACCAAGAGAGCACTTCGATC 



2821 



-+ 2880 



ATAGCACAGTTGGGAGTGTTATAACACTGACAGCAGAGTTTGGTTCTCTCGTGAAGCTAG 



BsaJI 
BstDSI 
Tthlllll 
Acil | 
Fnu4HI | 
BseMII | | 
Cjel |Taul| 



Mnll 
Dpnl ] 
Bglll 

Rsal BstYI 

BsrGI | Cjel Sau3AI 

Hpyl78III Cjel |TauI| |TaaI TatI j Nlalll HphI | 

■ | || III 

TCAGGAGCAAATCTACGCCGCCACGGTTTGTTTGTACAAATCCATGATGTAGTAGATCTC 

2881 + + + + + + 2940 

AGTCCTCGTTTAGATGCGGCGGTGCCAAACAAACATGTTTAGGTACTACATCATCTAGAG 
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CviJI 

Hael Bsp24I 

Haelll Sthl32I BscGI CjePI 

BsaJI Ddel Stul BseMII | BslI | Cjel| 

I II II M M 

ACCGAGGACACTCAGGCCTTTCTAAACTATACCTTTGACGGGAAAAATGGATTTACAAAC 



2941 



3000 



TGGCTCCTGTGAGTCCGGAAAGATTTGATATGGAAACTGCCCTTTTTACCTAAATGTTTG 



Sfcl Cjel Alul 

AccI | CjePI Dral CviJI 

Drain I Bsp24l| Msel | Ddel | Ddel 

I II II Mill 

CACCGAGTGTCTACAGGACTAAAATCCACATTTTAAAACTCTAAGCTCTGCTTAGAGTTT 



3001 



3060 



GTGGCTCACAGATGTCCTGATTTTAGGTGTAAAATTTTGAGATTCGAGACGAATCTCAAA 



Hinf I 
Ddel 
Sthl32I 
BsiEI 



Mspl 
Neil 
ScrFI 
BsaJI | 
CviJI | | 
Sfcl | | | 



Tfil 



TaqI 
Mnll | 
Bed | | 
I I I 



CviJI 
Hael 
Hael IT 
BsrDI | 
Mwol | | 



Ddel MboII 



I II I 



TCTGTAGCCCCGGTCGTCTTAGAATCCTCTATCCATCATCGAAGAACTTAGCAATGAAGG 



3061 



■+ 3120 



AGACATCGGGGCCAGCAGAATCTTAGGAGATAGGTAGTAGCTTCTTGAATCGTTACTTCC 



Hin4I 
Hinf I | 
Tfil 



CCAAGATTCTCACTCTATGAGAACCCCCCC 

3121 + + + 3150 

GGTTCTAAGAGTGAGATACTCTTGGGGGGG 
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1 
51 
101 
151 
201 
251 
301 
351 
401 
451 
501 
551 
601 
651 
701 
751 
801 



MKIPLRFLLI 
SATDGTNYVF 
FSNIDATTAS 
KGNLSLLDND 
GGAIHTKNLT 
I IFEGNTIGA 
S VADALN INS 
FKNGTWLKG 
IDSLRNGKKI 
I LELDAGKD I 
SFNPTAEQEA 
NVLHRSGREN 
MNTNFAKTYA 
YGQLSYGHTD 
SGRGFFQEYT 
LEKRFAEQYY 
IVQASGFRSL 



SLVPTLSMSN 
KDSWIENVP 
GAAIGSEAAN 
KVLIQDNFST 
LSSGGETLFQ 
TGTVSHSAID 
PDTGDNKEYT 
DWLSANGFS 
KLSAATAQKD 
VISADSRSID 
PLVPNLLWGS 
QRKFRHVSGG 
GSLRLQHDAS 
HRMKTESLPP 
PFVKVQAVYA 
HWAMYSPDV 
GAAAELFGNF 



LLGAATTEEL 
KTGETQSTSC 
KTVTLSGFSA 
GDGGAINCAG 
GNTAPTAAGK 
LGTSAKITAL 
GTIVFSGEKL 
QDANSKLIMD 
IRIDRPWLA 
AVQSPYGYQG 
FIDVRSFQNF 
AWGASTRMP 
LYSWSILLG 
PPPTLSTDHT 
RQDSFVELGA 
CRSNPKCTTT 
GFEWRGSSRS 



SASNSFDGTT 
FKNDAAAGDL 
LSFLKS PAST 
SLKIANNKSL 
GGAIAIADSG 
RAAQGHT I Y F 
TEAEAKDEKN 
LGTS LVAMTE 
ISDESFYQNG 
KWTINWSTDD 
IELGTEGAPY 
GGDTLSLGFA 
EGGLREILLP 
SWGGYVWAGE 
ISRDFSDSHL 
LLSNQGSWKT 
YNVDAGSKIK 



STTSFSSKTS 
NFLGGGFSFT 
VTNGLGAINV 
SFIGNSSSTR 
TLSISGDSGD 
YDPITVTGST 
RTSKLLQNVA 
SIELTNLEIN 
FLNEDHS YDG 
KKATVSWAKQ 
EKRFWVAGIS 
QLFARDKDYF 
YVSKTLPCSF 
LG TR V A VENT 
YNLAIPLGIK 
KGSNLARQAG 
F 



Possible T cell epitope: 
516 LLWGSFIDV 
Possible B cell epitope: 
554 HRSGRENQRKFRHV 
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1 MPPLNADDVL PRDHLSDGSF 
51 LKIIITLVLL RNALGVQQTP 
101 EIEANTIPQS LFTAEGAETV 
151 SQKTFPSEIR AHLTASDFVI 
2 01 TANVLVAMQM MMLSPLSISL 



Possible T cell epitope: 



188 VLIYLPFFV 



Possible B cell epitope: 
125 NKSKEPLR 



SDTYPDITTQ 
PSQVLNGIAL 
FVALNKSKEP 
IIPAFIMGQI 
PLKLLLIVMV 



AIILIFLALS 
ILSIYVMFPT 
LRSFLIRNTP 
KNAFEIGVLI 
DGWTLLLQGL 



PFLVMLLTSY 
GVAMYKDARK 
KAQIQSFYKI 
YLPFFVIDLV 
MISFK 
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Figure 29: CPN100422 

SPEAFSAFLD AKELLEKTKA DSEAYVAETE 
SWSKQIAFLE EETKNLRIRV REALVPLAIA 
ISQALKELTQ NKHIIISVNP KDLPLVEKSR 
VTPGGCIIET EAGIINAQLD VQLDALEKAF 
DSSSLSNDQD KKE 



Possible T cell epitope: 
163 LILTAKPDV 



Possible B cell epitope: 
22 6 SNDQDKKE 



1 MKFFSLIFKD DDVSPNKKVL 
51 QKCAQIRQEA KDQGFKEGSE 
101 SVRKIIGKEL ELHPETIVSI 
151 PELKNIVEYA DSLILTAKPD 
2 01 STILKAKNPV DEPSETSSST 
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Figure 30: CPN100424 

1 MTLLCCTSCN SRSLIVHGLP GREANEIWL LVSKGVAAQK LPQAAAATAG 

51 AATEQMWDIA VPSAQITEAL AILNQAGLPR MKGTSLLDLF AKQGLVPSEL 

101 QEKIRYQEGL SEQMASTIRK MDGWDASVQ I S FTTENEDN LPLTASVYIK 

151 HRGVLDNPNS IMVSKIKRLI ASAVPGLVPE NVSWSDRAA YSDITINGPW 

201 GLTEEIDYVS VWGIILAKSS LTKFRLIFYV LILILFVISC GLLWVIWKTH 

2 51 TLIMTMGGTK GFFNPTPYTK NALEAKKAEG AAADKEKKED ADSQGESKNA 

3 01 ETSDKDSSDK DAPEGSNEIE GA 



Possible T cell epitope: 
201 GLTEEIDYV 
Possible B cell epitope: 

2 84 DKEKKEDADSQGESKNAETSDKDSSDKDAPEGSNEIE 
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Figure 31: CPN100426 

1 MTIRVRNLAY SVNKKKILDG VTFSLERGHI TLFVGKSGSG KTM I LRALAG 

51 LVQPTQGDIW IEGEAPALVF QQPELFSHMT VLGNCTHPQI HIKGRSTEEA 

101 REKAFELLHL LD I EEVAKNY PDQLSGGQKQ RVAIVRSLCM DKHTLLFDEP 

151 TSALDPFATA SFRHLLETLR DQELTVGLTT HDMQFVHSCL DRIYLIDQGT 
2 01 VAGVYD KRDG ELDSGHPLSK YIHSAQ 



Possible T cell epitope: 
14 5 LLFDEPTSA 
Possible B cell epitope; 
2 05 YDKRDGE 
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Figure 32: CPN100508 

1 MKRPFFTYLC IIFYGSCASL SLHAGLSFPE VRGATAAWH ADSGKVFYDK 

51 DIDAVIYPAS MTKIATALFI LKHYPTVLDT LIKVKQDAIA SITPQAKKQS 

101 GYRSPPHWLE TDGSTIQLHL REELLGWDLF HALLVCSAND AANVLAMACC 

151 GSVEKFMDKL NFFLKEEIGC THTHFNNPHG LHHPNHYTTT RDLISIMRCA 

2 01 LKEPPFRGVI STTSYKIGAT NLHGERILSP TNKLLLPGST YHYPPALGGK 
251 TGTTKTAGKN LIMAAEKNNR LLVTIATGYS GPVSDLYQDV IALCETVFNE 

3 01 PLLRKELVPP SDCLQLEIAN LGKLSCPLPE GLYYDFYASE DREPLSVSFI 
351 AHADAFPIEQ GDLLGHWVFY DDEGKKISSQ PFYAPCRFER TIKPWKLYMK 

4 01 RVFTSYRTYM SITMLLMYFR IRKHRKYKNL KHYSKI 



Possible T cell epitope: 
156 FMDKLNFFL 
Possible B cell epitope: 
4 22 RKHRKYKN 
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Figure 33: CPN100515 

1 MASNPILQIE DLSITLAKQR QQYPIVQSLS FTINEGQTLA IIGESGSGKS 

51 VSAHAILRLL PCPPFSVSGQ VNFQGHNLLT ASRSIQKKII GTEISMIFQN 

101 PQASLNPVFT IEQQFREIIH THLALTAEVA KEKMLYALEE TGFHDPRLCL 

151 NLYPHQLSGG MLQRICIAMA LLCSPKLLIA DEPTTALDVS VQYQILQLLK 

2 01 TLQKKTGMSL LIITHNMGW AE T ADD VL VL YAGRMVE CAP AVQMFHNPSH 
251 PYTRDLLASR PSLQPQQLGS FNPIPGQPPH YTAFPSGCRY HPRCSKILNR 

3 01 CSAEAPEIYP VREGHKVRVG CMTTNFPQPL IQATSLTKHY YKRSFWFQGK 
351 TIASRPVDDV SFSLYSRRAV GLIGESGSGK STLALALAGL LPLTSGFLTF 
401 NGTPIKLHSK HGRHQLRSQV RLVFQNPQAS LNPRKTILDS LGHSLLYHKL 
451 VPKEKVLATV REYLELVGLS EEYFYRYPHQ LSGGQQQRVS IARALLGVPQ 
501 LIICDEIVSA LDLSIQAQIL NMLAELQKKL SLTYLFISHD LAWRSFCTE 
551 VFIMYKGQIV EKGNTKRIFS DPQHPYTRML LNAQLPETPD QRQSKPIFQE 
601 YHKDSEESCS TGCYFYNRCP QKQEACKSEI IPNQGDAHHT YRCIH 



Possible T cell epitope: 

5 9 LLPCPPFSV 

Possible B cell epitopes: 

18 KQRQQY 
587 ETPDQRQSK 
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Figure 34: CPN100538 

1 MPGIEKAATT VAVPQD KS EE EKVKERLTKR ELTCEDLKDN GYTVNFEDIS 

51 ILELLQFVSK ISGTNFVFDS NDLQFNVTIV SHDPTSVDDL STILLQVLKM 

101 HDLKWEQGN NVL I YRNPHL SKLSTWTDS SLKETCEAW VTRVFRLYRR 

151 QPSAAVNIIQ PLLSHDAIVS ASEATRHVII SDIAGNVDKV SDLLAALDCP 

201 GTSVDMTEYE VKYANPAALV SYCQDVLGTL AEDDAFQMFI QPGTNKIFW 

2 51 SSPRLANKAE QLLKSLDVPE MAHTLDDPAS TALALGGTGT TSPKSLRFFM 

3 01 YKLKYQNGEV IANALQDIGY NLYVTTAMDE DFINTLNSIQ WLEVNNSIVI 

3 51 IGNQGNVDRV IGLLNGLDLP PKQVYIEVLI LDTSLEKSWD FGVQWVAXiGD 
401 EQSKVAYASG LLNNTGIATP TKATVPPGTP NPGSIPLPTP GQLTGFSDML 

4 51 NSSSAFGLGI IGNVLSHKGK SFLTLGGLLS ALDQDGDTVI VLNPRIMAQD 
501 TQQASFFVGQ TVPYQTIKYY IQETGTVTQN IDYEDIGVNL WTSTVAPNN 
551 WTLQIEQTI SELHSASGSL TPVTDKTYAA TRLQIPDGCF LVMSGHIRDK 
601 TTKWSGVPL LNSIPLIRGL FSRTIDQRQK RNIMMFIKPK VISSFEEGTR 
651 VTNKEGYRYN WEADEGSMQV APRHAPECQG PPSLQAESDF KIIEIEAQ 



Possible T cell epitope: 

5 0 SILELLQFV 

Possible B cel'l epitopes: 

15 QDKSEEEK 
626 DQRQKRN 
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Figure 35: CPN100557 

1 MSRKDNEVSL ARSIFNILSG TFCSRITGIF RE I AMATYFG ADPIVAAFWL 
51 GFRTVFFLRK ILGGLILEQA FIPHFEFLRA QSLDRAAFFF RRFSRLIKGS 
101 TIIFTLLIEA VLWVFFNNVE EGTYDMILLT MILLPCGIFL MMYNVNGALL 
151 HCGNKFFGVG LAPWVNIIW IFFVIAARHS DPRERIIGLS VALVIGFFFE 
2 01 WLITVPGVWK FLLEAKSPPQ EHDSVRALLA PLSLGILTSS IFQLNLLSDI 
251 CLARYVHEIG PLYLMYSLKI YQLPIHLFGF GVFTVLLPAI SRCVQREDHE 
301 RGLKLMKFVL TLTMSVMIIM TAGLLLLALP GVRVLYEHGL FPQSAVYAIV 
351 RVLRGYGAS I IPMALAPLVS VLFYAQRQYA VPLFIGIGTA LANIVLSLVL 
401 GRWVLKDVSG ISYATSITAW VQLYFLWYYS SKRLPMYSKL LWESIRRSIK 
4 51 VMGTTMLACM ITLGLNILTQ TTYVIFLNPL TPLAWPLSSI TAQAIAFLSE 
501 SCIFLAFLFG FAKLLRVEDL INLASFEYWR GQRGLLQRQH VMQDTQN 



Possible T cell epitope: 

111 VLWVFFNNV 

Possible B cell epitopes: 

1 MSRKDNE 

2 95 QREDHERG 
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Figure 36: CPN100622 

1 MKTSRNKQCK ITDPLSKSSF FVGALILGKT TILLNATPLS DYFDNQANQL 

51 TTLFPLIDTL TNMTPYSHRA TLFGVRDDTN QDIVLDHQNS IESWFENFSQ 

101 DGGALSCKSL AITNTKMQIL FLNSFAIKRA GAMYVDGNFD LSENHGSIIF 

151 SGNLSFPNAS NFADTCTGGA VLCSKNVTIS KNQGTAYFIN NKAKSSGGAI 

201 QAAIINIKDN TGPCLFFNNA AGGTAGGALF ANACRIENNS QPIYFLNNQS 

251 GLGGAIRVHQ ECILTKNTGS VIFNNNFAME ADISANHSSG GAIYCISCSI 

301 KDNPGIAAFD NNTAARDGGA ICTQSLTIQD SGPVYFTNNQ GTWGGAIMLR 

3 51 QDGACTLFAD QGDIIFYNNR HFKDTFSNHV SVNCTRNVSL TVGASQGHSA 

401 TFYDPILQRY TIQNSIQKFN PNPEHLGTIL FSSTYIPDTS TSRDDFISHF 

451 RNHIGLYNGT LALEDRAEWK VYKFDQFGGT LRLGSRAVFS TTDEEQSSSS 

501 VGSVININNL AINLPSILGN RVAPKLWIRP TGSSAPYSED NNPIINLSGP 

551 LSLLDDENLD PYDTADLAQP IAEVPLLYLL DVTAKHINTD NFYPEGLNTT 

601 QHYGYQGVWS PYWIETITTS DTSSEDTVNT LHRQLYGDWT PTGYKVNPEN 

651 KGDIALSAFW QSFHNLFATL RYQTQQGQIA PTASGEATRL FVHQNSNNDA 

701 KGFHMEATGY SLGTTSNTAS NHSFGVNFSQ LFSNLYESHS DNSVASHTTT 

751 VALQINNPWL QERFSTSASL AYSYSNHHIK ASGYSGKIQT EGKCYSTTLG 

8 01 AALSCSLSLQ WRSRPLHFTP FIQAIAVRSN QTAFQESGDK ARKFSVHKPL 

851 YNLTVPLGIQ SAWESKFRLP TYWNIELAYQ PVIiYQQNPEI NVSLESSGSS 

901 WLLSGTTLAR NAIAFKGRNQ IFIFPKLSVF LDYQGSVSSS TTTHYLHAGT 

951 TFKF 



Possible T cell epitope: 

119 ILFLNSFAI 

Possible B cell epitopes: 

2 KTSRNKQ 
64 7 NPENKG 
6 94 QNSNNDAK 
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Figure 37: CPN100626 

1 MQVFPKVTLS LDYSADISSS TLSHYLNVAS RMRFLTISDQ NRKIKEPLVS 
51 KTPPKFLFYL GNFTACMFGM TPAVYSLQTD SLEKFALERD EEFRTSFPLL 
101 DSLSTLTGFS PITTFVGNRH NSSQDIVLSN YKSIDNILLL WTSAGGAVSC 
151 NNFLLSNVED HAFFSKNLAI GTGGAIACQG ACT I T KNRG P LIFFSMRGLN 
201 NAS TGGETRG GAIACNGDFT ISQNQGTFYF VNNSWNWGG ALSTNGHCRI 
251 QSNRAPLLFF NNTAPSGGGA LRSENTTISD NTRPIYFK3SIN CGNNGGAIQT 
301 SVTVAIKNNS GSVIFNNNTA LSGSINSGNG SGGAIYTTNL SIDDNPGTIL 
351 FNNNYCIRDG GAICTQFLTI KNSGHVYFTN NQGNWGGALM LLQDSTCLLF 
4 01 AEQGNIAFQN NEVFLTTFGR YNAIHCTPNS NLQLGANKGY TTAFFDPIEH 
451 QHPTTNPLIF NPNANHQGTI LFSSAYIPEA SDYENNFISS SKNTSELRNG 
501 VLSIEDRAGW QFYKFTQKGG ILKLGHAASI ATTANSETPS TSVGSQVIIN 
551 NLAINLPSIL AKGKAPTL W I RPLQSSAPFT EDNNPTITLS GPLTLLNEEN 
601 RDPYDSIDLS EPLQNIHLLS LSDVTARHIN TDNFHPESLN ATEHYGYQGI 
651 WSPYWVETIT TTNNASIETA NTLYRALYAN WTPLGYKVNP EYQGDLATTP 
701 LWQSFHTMFS LLRSYNRTGD SDIERPFLEI QGIADGLFVH QNSIPGAPGF 
751 RIQSTGYSLQ ASSETSLHQK ISLGFAQFFT RTKEIGSSNN VSAHNTVSSL 
801 YVELPWFQEA FATS HS LAY G YGDHHLHAYI RHIKNRAEGT CYSHTLAAAI 
851 GCSFPWQQKS YLHLSPFVQA IAIRSHQTAF EEIGDNPRKF VSQKPFYNLT 
901 LPLGIQGKWQ SKFHVPTEWT LELSYQPVLY QQNPQIGVTL LASGGSWDIL 
951 GHNYVRNALG YKVHNQTALF RSLDLFLDYQ GSVSSSTSTH HLQAGSTLKF 



Possible T cell epitope: 

56 FLFYLGNFT 

Possible B cell epitopes: 

3 9 DQNRKIK 
5 97 NEENRDPYD 
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Figure 38: CPN100628 

1 MLLPFTFVLA NEGLQLPLET YITLSPEYQA APQVGFTHNQ NQDLAIVGNH 
51 NDFILDYKYY RSNGGALTCK NLLISENIGN VFFEKNVCPN SGGAIYAAQN 
101 CTISKNQNYA FTTNLVSDNP TATAGS LLGG ALFAINCSIT NULGQGTFVD 
151 NLALMKGGAL YTETNLSIKD NKGPIIIKQN RALNSDSLGG GIYSGNSLNI 
201 EGNSGAIQIT SNSSGSGGGI FSTQTLTISS NKKLIEISEN SAFANNYGSN 
251 FNPGGGGLTT TFCTILNNRE GVLFNNNQSQ SNGGAIHAKS IIIKENGPVY 
3 01 FLNNTATRGG ALLNLSAGSG NGSFILSADN GDIIFNNNTA SKHALNPPYR 
351 NAIHSTPNMN LQIGARPGYR VLFYDPIEHE LPSSFPILFN FETGHTGTVL 
401 FSGEHVHQNF TDEMNFFSYL RNTSELRQGV LAVEDGAGLA CYKFFQRGGT 
451 LLLGQGAVIT TAGTIPTPSS TPTTVGSTIT LNHIAIDLPS ILSFQAQAPK 
501 IWIYPTKTGS TYTEDSNPTI TISGTLTLRN SNNEDPYDSL DLSHSLEKVP 
551 LLYIVDVAAQ KINSSQLDLS TLNSGEHYGY QGIWSTYWVE TTTITNPTSL 
601 LGANTKHKLL YANWS PLGYR PHPERRGEFI TNALWQSAYT ALAGLHSLSS 
651 WDEEKGHAAS LQGIGLLVHQ KDKNGFKGFR SHMTGYSATT EATSSQSPNF 
701 SLGFAQFFSK AKEHESQNST SSHHYFSGMC XAKYSLQRVI RLSVSLAYMF 
751 TSEHTHTMYQ GLLEGNSQGS FHNHTLAGAL SCVFLPQPHG ESLQIYPFIT 
801 ALAI RGNLAA FQESGDHARE FSLHRPLTDV SLPVGIRASW KNHHRVPLVW 
851 LTEISYRSTL YRQDPELHSK LLISQGTWTT QATPVTYNAli GIKVKNTMQV 
901 FPKVTLSLDY SADISSSTLS H YLNVAS RMR F 



Possible T cell epitope: 

1 MLLPFTFVL 

Possible B cell epitopes: 

38 HNQNQ 

619 YRPHPERRG 

669 HQKDKNG 
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Figure 39: CPN100630 

1 MPLSFKSSSF CLLACLCSAS CAFAETRLGG NFVPPITNQG EEILLTSDFV 

51 CSNFLGASFS SSFINSSSNL SLLGKGLSLT FTSCQAPTNS NYALLSAAET 

101 LTFKNFSSIN FTGNQSTGLG GLIYGKDIVF QSIKDLIFTT NRVAYSPASV 

151 TTSATPAITT VTTGASALQP TDSLTVENIS QSIKFFGNLA NFGSAISSSP 

2 01 TAWKFINWT ATMS FSHNFT SSGGGVIYGG SSLLFENNSG CIIFTANSCV 
251 NSLKGVTPSS GTYALGSGGA ICIPTGTFEL KNNQGKCTFS YNGTPNDAGA 

3 01 IYAETCNIVG NQGALLLDSN TAARNGGAIC AKVLN I QGRG PIEFSRNRAE 
351 KGGAIFIGPS VGDPAKQTST LTILASEGDI AFQGNMLNTK PGIRNAITVE 

4 01 AGGEIVSLSA QGGSRLVFYD PITHSLPTTS PSNKDITINA NGASGSWFT 
451 SKGLSSTELL LPANTTTILL GTVKIASGEL KI TDNAWNV AGFATQGSGQ 
501 LTLGSGGTLG LAT PTG AP AA VDFTIGKLAF DPFSFLKRDF VSASVNAGTK 
551 NVTLTGALVL DEHDVTDLYD MVSLQSPVAI P I AVFKGATV TKTGFPDGE I 
601 ATPSHYGYQG KWSYTWSRPL LIPAPDGGFP GGPSPSANTL YAVWNSDTLV 
651 RSTYILDPER YGEIVSNSLW ISFLGNQAFS DILQDVLLID HPGLS ITAKA 
701 LGAYVEHTPR QGHEGFSGRY GGYQAALSMN YTDHTTLGLS FGQLYGKTNA 
751 NPYDSRCSEQ MYLLSFFGQF PIVTQKSEAL ISWKAAYGYS KNHLNTTYLR 
8 01 PDKAPKSQGQ WHNNSYYVLI SAEHPFLNWC LLTRPLAQAW DLSGFISAEF 
851 LGGWQSKFTE TGDLQRSFSR GKGYNVSLPI GCSSQWFTPF KKAPSTLTIK 
901 LAYKPD I YR V NPHNIVTWS NQESTSISGA NLRRHGLFVQ IHDWDLTED 
951 TQAFLNYTFD GKNGFTNHRV STGLKSTF 



Possible T cell epitope: 

936 GLFVQIHDV 

Possible B cell epitopes : 

281 KNNQGK 
34 5 SRNRAEK 
707 HTPRQGHE 
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